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(57) Abstract: The present invention is in the field of soybean genetics. More specifically, the invention relates to nucleic acid 
molecules from regions of the soybean genome, which are associated with soybean cyst nematode resistance. The invention also 
relates to proteins encoded by such nucleic acid molecules as well as antibodies capable of recognizing these proteins. The invention 
also relates to nucleic acid markers from regions of the soybean genome, which are associated with soybean cyst nematode resistance. 
Moreover, the invention relates to uses of such molecules, including, transforming soybean cyst nematode resistant soybean with 
constructs containing nucleic acid molecules from regions of the soybean genome, which are associated with soybean cyst nematode 
resistance. Furthermore, the invention relates to the use of such molecules in a plant breeding program. 
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NUCLEIC ACID MOLECULES AND OTHER MOLECULES 
ASSOCIATED WITH SOYBEAN CYST NEMATODE RESISTANCE 

FIELD OF THE INVENTION 
The present invention is in the field of soybean genetics. More specifically, the 
invention relates to nucleic acid molecules from regions of the soybean genome, which 
are associated with soybean cyst nematode (SCN) resistance. The invention also relates 
to proteins encoded by such nucleic acid molecules as well as antibodies capable of rec- 
ognizing these proteins. The invention also relates to nucleic acid markers from regions 
of the soybean genome, which are associated with SCN resistance. Moreover, the inven- 
tion relates to uses of such molecules, including, transforming SCN sensitive soybean 
with constructs containing nucleic acid molecules from regions in the soybean genome, 
which are associated with SCN resistance. Furthermore, the invention relates to the use 
of such molecules in a plant breeding program. 

BACKGROUND OF THE INVENTION 
The soybean, Glycine max (L.) Merril (Glycine max or soybean), is one of the 
major economic crops grown worldwide as a primary source of vegetable oil and protein 
(Sinclair et al, Compendium of Soybean Diseases, 3 rd Ed. APS Press, St. Paul, MN, p. 
106. (1989)). The growing demand for low cholesterol and high fiber diets has also 
increased soybean's importance as a health food. 

Prior to 1940, soybean cultivars were either direct releases of introductions 
brought from Asia or pure line selections from genetically diverse plant introductions. 
The soybean plant was primarily used as a hay crop in the early part of the 19th century. 
Only a few introductions were large-seeded types useful for feed grain and oil production. 
From the mid 1930' s through the 1960's, gains in soybean seed yields were achieved by 
changing the breeding method from evaluation and selection of introduced germplasm to 
crossing elite by elite lines. The continuous cycle of cross hybridizing the elite strains 
selected from the progenies of previous crosses resulted in the modern day cultivars. 
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Over 10,000 soybean strains have now been introduced into the United States 
since the early 1900' s (Bernard et al, United States National Germplasm Collections. In: 
L.D. Hil (ed.), World Soybean Research, pp. 286-289. Interstate Printers and Publ., 
Danville, II. (1976)). A limited number of those introductions form the genetic base of 
cultivars developed from the hybridization and selection programs (Johnson et al, The 
Soybean, Norman Ed., Academic Press, N.Y., pp. 1-73 (1963)). For example, in a survey 
conducted by Specht and Williams, Genetic Contributions, Fehr eds. American Soil 
Association, Wisconsin, pp. 49-73 (1984), for the 136 cultivars released from 1939 to 
1989, only 16 different introductions were the source of cytoplasm for 121 of that 136. 
Certain soybean strains are sensitive to one or more pathogens. One economically 
important pathogen is SCN. 

SCN accounts for roughly 40% of the total disease in soybean and can result in 
significant yield losses (up to 90%). SCN is the most destructive pest of soybean to date 
and accounts for an estimated yield loss of up to $809 million dollars annually. 
Currently, the most cost effective control measures are crop rotation and the use of host 
plant resistance. While breeders have successfully developed SCN resistant soybean 
lines, breeding is both difficult and time consuming due to the complex and polygenic 
nature of resistance. The resistance is often race specific and does not provide stability 
over time due to changing SCN populations in the field, hi addition, many of the resistant 
soybean varieties carry a significant yield penalty when grown in the absence of SCN. 

SCN, Heterodera glycines Ichinohe, was identified on soybeans in the United 
States in 1954 at Castle Hayne, N.C. Winstead, et al, Plant Dis. Rep. 39:9-1 1 (1955). 
Since its discovery the SCN has been recognized as one of the most destructive pests in 
soybean. It has been reported in nearly all states in which soybeans are grown, and it 
causes major production problems in several states, being particularly destructive in the 
Midwestern states. See generally: Caldwell, etal.,Agron. J. 52:635-636 (1960); Rao- 
Arelli et al, Crop. Sci. 25:650-652, (1988); Baltazar et al, Soybean Genet. Newsl. 
79:120-122 (1992); Concibido, et al, Crop. Sci., (1993). For example, sensitive soybean 
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cultivars had 5.7-35.8% lower seed yields than did resistant cultivars on SCN race-3 
infested sites in Iowa. (Niblack et al, Plant Dis. 76:943-948 (1992)). 

Shortly after the discovery of SCN in the United States, sources of SCN resist- 
ance were identified (Ross et al, Plant Dis. Rep. 41:923-924 (1957)). Some lines such as 
Peking and Plant Introduction (PI) PI88788, were quickly incorporated into breeding pro- 
grams. Peking became widely used as a source of resistance due to its lack of agronom- 
ically undesirable traits, with Pickett as the first SCN resistant cultivar released (Brim et 
al, Crop Sci. 6:305 (1966)). The recognition that certain SCN resistant populations could 
overcome resistant cultivars lead to an extensive screen for additional sources of SCN 
resistance. PI88788 emerged as a popular source of race 3 and 4 resistance even though it 
had a cyst index greater than 10% (but less than 20%) against race 4, and Peking and its 
derivatives emerged as a popular source for races 1 and 3. PI437654 was subsequently 
identified as having resistance to all known races and its SCN resistance was backcrossed 
into Forrest. Currently there are more than 130 Pis known to have SCN resistance. 

SCN race 3 is considered to be the prominent race in the Midwestern soybean 
producing states. Considerable effort has been devoted to the genetics and breeding for 
resistance to race 3. While both Peking and PI88788 are resistant to SCN race 3, classical 
genetics studies suggest that they harbor different genes for race 3 resistance (Rao-Arelli 
et al, Crop Sci. 25:650-652 (1988)). Crosses between PI88788(R) and Essex(S) segre- 
gate 9(R): 55(S) in the F 2 population and 1(R): 26(Seg): 37(S) families in the F 3 genera- 
tion, suggesting that resistance to race 3 in PI88788 is conditioned by one recessive and 
two dominant genes, where as Peking and PI90763 resistance is conditioned by one domi- 
nant and two recessive genes. Based on reciprocal crosses, Peking, Forrest, and PI90763 
have genes in common for resistance to SCN race 3 (Rao-Arelli et al, Crop Sci, 28:650- 
652 (1988)). A cross between Peking and PI88788 segregates 13(R):3(S) in the F 2 gen- 
eration, indicating a major difference between the parents for race 3 resistance. Genera- 
tion mean analysis based on four crosses between resistant and sensitive genotypes; A20 
(R), Jack (R), Cordell (R) and A2234 (S), suggests that an additive genetic model is 
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sufficient to explain most of the genetic variation of race 3 SCN resistance in each cross, 
while the analysis of the pooled data indicates the presence of dominant effects as well 
(Mansur et al, Crop Sci. 53:1249-1253 (1993)). This analysis further indicates that race 
3 resistance is probably under the genetic control of three, but not more than four genes. 

RFLP analysis of segregating populations between resistant and sensitive lines; 
PI209332 (R), PI90763 (R), PI88788 (R), Peking (R) and Evan (S), identified a major 
SCN resistance QTL ( rhgV) which maps to linkage group G (Concibido et al, Theor 
Appl. Genet. 93:234-241 (1996)). In this study, rhgl explains 51.4% of the phenotypic 
variation in PI209322, 52.7% of the variation in PI90763, 40.0% of the variation in 
PI88788 and 28.1% of the variation in Peking. This major resistance QTL was assumed 
be one and the same in all of the mapping populations employed. However, as pointed 
out by the authors, it is possible that the genomic interval contains distinct but tightly 
linked QTLs. In a related study using PI209332 as the source of resistance, Concibido et 
al, Crop Sci. 36:1643-1650 (1996), show that a QTL on linkage group G (rhgl) is 
effective against the three SCN races tested, explaining 35% of the phenotypic variation 
to race 1, 50% of the variation to race 3, and 54% of the variation to race 6. In addition to 
the major QTL on linkage group G, 4 other QTLs mapping to linkage groups D, J, L and 
K were identified, with some of the resistance loci behaving in a race specific manner. 

Concibido et al (Crop Sci. 37:258-264 (1997)) found significant association of 
marker C006V to a major QTL on linkage group G ( rhgl) and resistance to race 1, race 3 
and race 6, in Peking and PI90763 (Evan X Peking, Evan X PI90763) and races 3 and 6 in 
PI88788 (Evan X PI88788), in agreement with the previous study based on the P209332 
source of resistance (Concibido et al, Crop Sci. 36:1643-1650 (1996)). The resistance 
locus near C006V was effective against all races tested in all of the resistance sources. 
While statistically significant against all races, this locus accounts for different 
proportions of the total phenotypic variation with the races tested. For example, in 
PI90763 the resistance locus near C006V explains more than three times the phenotypic 
variation against race 1 than against race 3. The variability can be attributed to 
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differences in the genetic backgrounds, variability among the SCN populations or may be 
a reflection of the limited size of the plant populations which were employed. This study 
further identified three additional independent SCN resistance QTLs; one near the RFLP 
marker A378H mapping to the opposite end of linkage group G from C006V (rhgl), one 
near the marker B032V-1 on linkage group J and a third linked to A280Hae-l on linkage 
group N. Comparisons between the different SCN races indicated that some of the 
putative SCN QTLs behave in a race specific manner. 

PI437654 was identified as having resistance to all known races. Based on anal- 
ysis of 328 recombinant inbreed lines (RIL) derived from a cross between PI437654 and 
BSR101, Webb reported six QTLs associated with SCN resistance on linkage groups A2, 
CI, G, M, L25 and L26 (U.S. Patent 5,491,081). An allele on linkage group G, presumed 
to be rhgl, is involved with certain SCN races tested (races 1, 2, 3, 5 and 14), and has the 
largest reported phenotypic effect on resistance to every race, hi contrast, the QTLs on 
linkage groups A2, CI, M, L25 and L26 act in a race specific manner. The QTL on link- 
age group L25 was reportedly involved with four of the five races, while the QTLs on 
linkage groups, A2, CI and L26 were each involved in resistance to two of the five races 
(U.S. Patent 5,491,081). Webb further reports data that the resistance to any of the five 
races is likely to result from the combined effects of the QTL involved in each race (U.S. 
Patent 5,491,081). 

Qui et al. (TheorAppl Genet 95:356-364 (1999)) screened 200 F 2;3 families 
derived from a cross between Peking and Essex and identified RFLP markers which are 
associated with SCN resistance QTLs on linkage groups B, E, I and H. The three QTLs 
on linkage groups B, E and H jointly account for 57.7% of the phenotypic variation to 
race 1, the QTLs on linkage groups H and B account for 21.4% of the variation to race 3, 
while the QTLs on linkage groups I and E are associated with resistance to race 5 ac- 
counting for 14.0% of the phenotypic variation. In contrast to previous mapping studies 
which use Peking as the source of resistance, no significant association was detected to 
the rhgl locus on linkage group G. The authors point out that the marker Bngl22, which 
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has been shown to have significant linkage to rhgl, is not polymorphic in the population 
employed (Concibido et al, Crop Sci. 36: 1643-1650 (1996)). 

It has been reported that the rhgl locus on linkage group G is necessary for the 
development of resistance to any of the SCN races. There have been efforts to develop 
molecular markers to identify breeding lines harboring the rhgl SCN resistant allele. One 
of the most commonly used markers for marker assisted selection (MAS) of rhgl is an 
SSR locus that co-segregates and maps roughly 0.4 cM from rhgl. This SRR marker, 
BARC-Satt_309 is able to distinguish most, if not all, of the SCN sensitive genotypes 
from those harboring rhgl from important sources of resistance such as Peking and 
PI437654. Two simple sequence repeat markers have been reported that can be used to 
select for SCN resistance at the rhgl locus (Concibido et al, TheorAppl Genet 99: 811- 
818 (1999)). Satt_309 was also effective in distinguishing SCN resistant sources PI88788 
and PI209332 in many, but not all, sensitive genotypes, hi particular, Satt_309 can not be 
used for MAS in populations developed from "typical" southern US cultivars (e.g., Lee, 
Bragg and Essex) crossed with resistance sources PI88788 or PI209332. 

Matson et al. have reported a dominant SCN resistance locus, RhgA, which is 
tightly linked to the T locus on linkage group A2 (Matson et al, Crop Sci. 5:447 (1965)). 
The QTL reported by Webb on linkage group A2 maps near the T locus and is 
considered to be Rhg4 (U.S. Patent 5,491,081). Webb concludes that only two loci on 
linkage groups A2 (Rhg4) and G (rhgl) explain the genetic variation to race 3. 

SUMMARY OF THE INVENTION 
The present invention includes and provides a method for the production of a 
soybean plant having an rhgl SCN resistant allele comprising: (A) crossing a first soy- 
bean plant having an rhgl SCN resistant allele with a second soybean plant having an 
rhgl SCN sensitive allele to produce a segregating population; (B) screening the segre- 
gating population for a member having an rhgl SCN resistant allele with a first nucleic 
acid molecule capable of specifically hybridizing to linkage group G, wherein the first 
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nucleic acid molecule specifically hybridizes to a second nucleic acid molecule that is 
linked to the rhgl SCN resistant allele; and, (C) selecting the member for further crossing 
and selection. 

The present invention includes and provides a method of investigating an rhgl 
haplotype of a soybean plant comprising: (A) isolating nucleic acid molecules from the 
soybean plant; (B) determining the nucleic acid sequence of an rhgl allele or part thereof; 
and, (C) comparing the nucleic acid sequence of the rhgl allele or part thereof to a 
reference nucleic acid sequence. 

The present invention includes and provides a method of introgressing SCN re- 
sistance or partial SCN resistance into a soybean plant comprising: performing marker as- 
sisted selection of the soybean plant with a nucleic acid marker, wherein the nucleic acid 
marker specifically hybridizes with a nucleic acid molecule having a first nucleic acid se- 
quence that is physically linked to a second nucleic acid sequence that is located on link- 
age group G of soybean A3244, wherein the second nucleic acid sequence is within 500 
kb of a third nucleic acid sequence which is capable of specifically hybridizing with the 
nucleic acid sequence of SEQ ID NO: 5, 6, complements thereof, or fragments thereof 
having at least 15 nucleotides; and, selecting the soybean plant based on the marker 
assisted selection. 

The present invention includes and provides a method for the production of a 
soybean plant having an RhgA SCN resistant allele comprising: (A) crossing a first soy- 
bean plant having an RhgA- SCN resistant allele with a second soybean plant having an 
RhgA SCN sensitive allele to produce a segregating population; (B) screening the segre- 
gating population for a member having an RhgA SCN resistant allele with a first nucleic 
acid molecule capable of specifically hybridizing to linkage group A2, wherein the first - 
nucleic acid molecule specifically hybridizes to a second nucleic acid molecule linked to 
the RhgA SCN resistant allele; and, (C) selecting the member for further crossing and 
selection. 
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The present invention includes and provides a method of investigating an RhgA 
haplotype of a soybean plant comprising: (A) isolating nucleic acid molecules from the 
soybean plant; (B) determining the nucleic acid sequence of an RhgA allele or part 
thereof; and (C) comparing the nucleic acid sequence of the RhgA allele or part thereof to 
a reference nucleic acid sequence. 

The present invention includes and provides a method of introgressing SCN re- 
sistance or partial SCN resistance into a soybean plant comprising: performing marker as- 
sisted selection of the soybean plant with a nucleic acid marker, wherein the nucleic acid 
marker specifically hybridizes with a nucleic acid molecule having a first nucleic acid se- 
quence that is physically linked to a second nucleic acid sequence that is located on link- 
age group A2 of soybean A3244, wherein the second nucleic acid sequence is within 500 
kb of a third nucleic acid sequence which specifically hybridizes with the nucleic acid 
sequence of SEQ ID NO: 7, complements thereof, or fragments thereof having at least 15 
nucleotides; and, selecting the soybean plant based on the marker assisted selection. 

The present invention includes and provides a substantially purified nucleic acid 
molecule comprising a nucleic acid sequence selected from the group consisting of SEQ 
ED NOs: 5, 6, 8-23, 28-43, complements thereof, and fragments of either. 

The present invention includes and provides a substantially purified first nucleic 
acid molecule with nucleic acid sequence which specifically hybridizes to a second 
nucleic acid molecule having a nucleic acid sequence selected from the group consisting 
of a complement of SEQ ID NOs: 5, 6, 8-23, 28-43. 

The present invention includes and provides a substantially purified nucleic acid 
molecule comprising a nucleic acid sequence selected from the group consisting of SEQ 
ID NOs: 7, 44-47, and 50-53, complements thereof, and fragments of either. 

The present invention includes and provides a substantially purified first nucleic 
acid molecule with nucleic acid sequence which specifically hybridizes to a second 
nucleic acid molecule having a nucleic acid sequence selected from the group consisting 
of a complement of SEQ ID NOs: 50-53. 
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The present invention includes and provides a substantially purified protein or 
fragment thereof comprising an amino acid sequence selected from the group consisting 
of SEQ ID NOs: 1097, 1098, and 1100-1115 and fragments thereof. 

The present invention includes and provides a substantially purified protein or 
fragment thereof comprising an amino acid sequence selected from the group consisting 
of SEQ ID NOs 1099, and 1116-1119 and fragments thereof. 

The present invention includes and provides a transformed plant having a nucleic 
acid molecule which comprises: (A) an exogenous promoter region which functions in a 
plant cell to cause the production of a mRNA molecule; (B) a structural nucleic acid mol- 
ecule encoding a protein or fragment thereof comprising an amino acid sequence selected 
from the group consisting of SEQ ID NOs: 1097, 1100, 1098, 1101, 1102-1115; and (C) a 
3' non-translated sequence that functions in the plant cell to cause termination of trans- 
cription and addition of polyadenylated ribonucleotides to a 3' end of the mRNA 
molecule. 

The present invention includes and provides a transformed plant having a nucleic 
acid molecule which comprises: (A) an exogenous promoter region which functions in a 
plant cell to cause the production of a mRNA molecule; (B) a structural nucleic acid 
molecule encoding a protein or fragment thereof comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOs: 1099, 1116-1119; and (C) a 3' non- 
translated sequence that functions in the plant cell to cause termination of transcription 
and addition of polyadenylated ribonucleotides to a 3' end of the mRNA molecule. 

The present invention includes and provides a transgenic seed having a nucleic 
acid molecule which comprises: (A) an exogenous promoter region which functions to 
cause the production of a mRNA molecule; (B) a structural nucleic acid molecule 
encoding a protein or fragment thereof comprising an amino acid sequence selected from 
the group consisting of SEQ ID NOs: 1097, 1100, 1098, 1101, 1102-1115; and (C) a 3' 
non-translated sequence that functions to cause termination of transcription and addition 
of polyadenylated ribonucleotides to a 3' end of the mRNA molecule. 

The present invention includes and provides a transgenic seed having a nucleic 
acid molecule which comprises: (A) an exogenous promoter region which functions to 
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cause the production of a mRNA molecule; (B) a structural nucleic acid molecule encod- 
ing a protein or fragment thereof comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOs: 1099, 1116-1119; and (C) a 3' non-translated sequence 
that functions to cause termination of transcription and addition of polyadenylated ribo- 
nucleotides to a 3' end of the mRNA molecule. 

DESCRIPTION OF THE FIGURES 
Figure 1 is an amino acid sequence alignment of the leucine rich repeat domain of 

rhgl. 

Figure 2 is an amino acid sequence alignment of the leucine rich repeat domain of 

RhgA. 

DESCRIPTION OF THE SEQUENCE LISTINGS 

The following sequence listings form part of the present specification and are 
included to further demonstrate certain aspects of the present invention. The invention 
may be better understood by reference to one or more of these sequences in combination 
with the detailed description presented herein. 

SEQ ID NOs: 1-7 and 1097-1099 all refer to sequences from the line A3244. 

SEQ ID NO: 1 is sequence ID 515O02_region_G2 from line A3244, and is 
adjacent to the contig containing rhgl. 

SEQ ID NO: 2 is sequence ID 240O17_region_G3 from line A3244, and contains 
the rhgl, v.l four exon gene at coding coordinates 45163-45314, 45450-45509, 46941- 
48763, 48975-49573. The amino acid translation for SEQ ID NO: 2 is SEQ ID NO: 
1097. 

SEQ ID NO: 3 is sequence ID 240O17_region_G3 from line A3244, and contains 
the rhgl, v.2 two exon gene at coding coordinates 46798-48763 and 48975-49573. The 
amino acid translation for SEQ ID NO: 3 is SEQ ID NO: 1098. 
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SEQ ID NO: 4 is sequence ID 318013_region_A3 from line A3244, contains 
the RhgA gene at coding coordinates 111805-113968 and 114684-115204, and has an 
amino acid translation of SEQ ID NO: 1099. 

SEQ ID NO: 5 is sequence ID 240O17_region_G3_8_mRNA, and comprises the 
two rhgl, v.2 exons from the coding sequence portion of SEQ ID NO: 3. 

SEQ ID NO: 6 is sequence ID 240O17_region_G3_8_cds, and comprises the four 
rhgl, v.l exons from the coding sequence portion of SEQ ID NO: 2. 

SEQ ID NO: 7 is sequence ID 318013_region_A3_17_cds, and comprises the 
RhgA coding sequence portion from SEQ ID NO: 4. 

SEQ ID NOs: 8-43 and 1100-1115 all refer to rhgl sequences. 

SEQ ID NO: 8 is sequence ID rhgl_A3244_amplicon from line A3244, contains 
four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1891-3713, and 3925-4523, 
and has an amino acid translation of SEQ ID NO: 1100 and 1097. 

SEQ ID NO: 9 is sequence ID rhgl_A3244_amplicon, contains two rhgl, v.2 
exons at coding coordinates 1748-3713 and 3925-4523 and has an amino acid translation 
of SEQ ID NO: 1 101 and 1098. 

SEQ ID NO: 10 is sequence ID rhgl_peking_amplicon from the line peking, 
contains four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1888-3710, and 
3903-4501, and has an amino acid translation of SEQ ID NO: 1 102. 

SEQ ED NO: 1 1 is sequence ID rhgl_peking_amplicon, contains two rhgl, v.2 
exons at coding coordinates 1745-3710 and 3903-4501, and has an amino acid translation 
of SEQ ID NO: 1103. 

SEQ ID NO: 12 is sequence ID rhgl_toyosuzu_amplicon from the line toyosuzu, 
contains four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1890-3712, and 
3924-4522, and has an amino acid translation of SEQ ID NO: 1104. 

SEQ ID NO: 13 is sequence ID rhgl_toyosuzu_amplicon, contains two rhgl, v.2 
exons at coding coordinates 1747-3712 and 3924-4522, and has an amino acid translation 
of SEQ ID NO: 1105. 
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SEQ ID NO: 14 is sequence ID rhgl_will_amplicon from the line will, contains 
four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1891-3713, and 3925-4523, 
and has an amino acid translation of SEQ ID NO: 1 106. 

SEQ ID NO: 15 is sequence ID rhgl_will_amplicon, contains two rhgl, v.l 
exons at coding coordinates 1748-3713 and 3925-4523, and has an amino acid translation 
of SEQ ID NO: 1107. 

SEQ ID NO: 16 is sequence ID rhgl_a2704_amplicon from the line A2704, 
contains four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1891-3713, and 
3925-4523, and has an amino acid translation of SEQ ID NO: 1108. 

SEQ ID NO: 17 is sequence ID rhgl_a2704_amplicon, contains two rhgl, v.l 
exons at coding coordinates 1748-3713 and 3925-4523, and has an amino acid translation 
of SEQ ID NO: 1109. 

SEQ ID NO: 18 is sequence ID rhgl_noir_amplicon from the line noir, contains 
four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1876-3698, and 3910-4508, 
and has an amino acid translation of SEQ ID NO: 1 1 10. 

SEQ ID NO: 19 is sequence ID rhgl_noir_amplicon, contains two rhgl, v.l 
exons at coding coordinates 1733-3698 and 3910-4508, and has an amino acid translation 
of SEQ ID NO: 1111. ' 

SEQ ID NO: 20 is sequence ID rhgl_lee_amplicon from the line lee, contains 
four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1876-3698, and 3910-4508, 
and has an amino acid translation of SEQ ID NO: 1 1 12. 

SEQ ID NO: 21 is sequence ID rhgl_lee_amplicon, contains two rhgl, v.l exons 
at coding coordinates 1733-3698 and 3910-4508, and has an amino acid translation of 
SEQ ID NO: 1113. 

SEQ ID NO: 22 is sequence ID rhgl_pi200499_amplicon from the line 
PI200499, contains four rhgl, v.l exons at coding coordinates 113-264, 400-459, 1876- 
3698, and 3910-4508, and has an arnino acid translation of SEQ ID NO: 1114. 
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SEQ ID NO: 23 is sequence ID rhgl_ pi200499_amplicon, contains two rhgl, v. 2 
exons at coding coordinates 1733-3698 and 3910-4508, and has an amino acid translation 
of SEQ ID NO: 1115. 

SEQ ID NO: 24 is sequence ID 240O17_region_G3_forward_l, is a primer that 
hybridizes to coordinates 45051-45077 on contig 240017_region_G3 before the start 
codon, and can be used with SEQ ID NO: 25. 

SEQ ID NO: 25 is sequence ID 240O17_region_G3_reverse_l, is a primer that 
hybridizes to coordinates 47942-47918 on contig 240017_region_G3, and can be used 
with SEQ ID NO: 24. 

SEQ ID NO: 26 is sequence ID 240O17_region_G3_forward_2, is a primer that 
hybridizes to coordinates 47808-47831 on contig 240017_region_G3, and can be used 
with SEQ ID NO: 27. 

SEQ ID NO: 27 is sequence ID 240O17_region_G3_reverse_2, is a primer that 
hybridizes to coordinates 49553-49531 of contig 240017_region_G3 prior to the stop 
codon, and can be used with SEQ ID NO: 26. 

Primers given by SEQ ID NOs: 24-27 are used to create the amplicons of SEQ ID 
NOs: 8-23. The final 22 bases are added to the actual amplicons in order to simulate the 
rest of the gene to the stop codon, in order to allow complete translation. 

SEQ ID NO: 28 is sequence ID rhgl_A3244_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 8. 

SEQ ID NO: 29 is sequence ID rhgl_peking_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 10. 

SEQ ID NO: 30 is sequence ID rhgl_toyosuzu_amplicon_cds, which is the 
coding sequence portion of SEQ ID NO: 12. 

SEQ ID NO: 31 is sequence ID rhgl_will_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 14. 

SEQ ID NO: 32 is sequence ID rhgl_a2704_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 16. 
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SEQ ID NO: 33 is sequence ID rhgl_noir_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 18. 

SEQ ID NO: 34 is sequence ID rhgl_lee_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 20. 

SEQ ID NO: 35 is sequence ID rhgl_pi200499_amplicon_cds, which is the 
coding sequence portion of SEQ ID NO: 22. 

SEQ ID NO: 36 is sequence ID rhgl_A3244_amplicon_cds_2, which is the 
coding sequence portion of SEQ ID NO: 9. 

SEQ ID NO: 37 is sequence ID rhgl_peking_amplicon_cds_2, which is the 
coding sequence portion of SEQ ID NO: 11. 

SEQ ID NO: 38 is sequence ID rhgl_toyosuzu_amplicon_cds_2, which is the 
coding sequence portion of SEQ ID NO: 13. 

SEQ ID NO: 39 is sequence ID rhgl_will_amplicon_cds_2, which is the coding 
sequence portion of SEQ ID NO: 15. 

SEQ ID NO: 40 is sequence ID rhgl_a2704_amplicon_cds_2, which is the 
coding sequence portion of SEQ ID NO: 17. 

SEQ ID NO: 41 is sequence ID rhgl_noir_amplicon_cds_2, which is the coding 
sequence portion of SEQ ID NO: 19. 

SEQ ID NO: 42 is sequence ID rhgl_lee_amplicon_cds_2, which is the coding 
sequence portion of SEQ ID NO: 21. 

SEQ ID NO: 43 is sequence ID rhgl_pi200499_amplicon_cds_2, which is the 
coding sequence portion of SEQ ID NO: 23. 

SEQ ID NOs: 44-53 and 1116-1119 all refer to RhgA sequences 

SEQ ID NO: 44 is sequence ID rhg4_a3244_amplicon from the line A3244, 
contains RhgA at coding coordinates 79-2242 and 2958-3478, is made using SEQ ID 
NOs: 48 and 49, and has an amino acid translation of SEQ ID NO: 1 1 16 and 1099. 
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SEQ ID NO: 45 is sequence ID rhg4_Minsoy_amplicon from the line Minsoy, 
contains RhgA at coding coordinates 79-2242 and 2958-3478, is made using SEQ ID 
NOs: 48 and 49, and has an amino acid translation of SEQ ID NO: 1117. 

SEQ ID NO: 46 is sequence ID rhg4_Jack_amplicon from the line Jack, contains 
RhgA at coding coordinates 79-2242 and 2958-3478, is made using SEQ ID NO: 48 and 
49, and has an amino acid translation of SEQ ID NO: 1118. 

SEQ ID NO: 47 is sequence ID rhg4_peking_amplicon from the line Peking, 
contains RhgA at coding coordinates 79-2242 and 2958-3478, is made using SEQ ID 
NOs: 48 and 49, and has an amino acid translation of SEQ ID NO: 1 1 19. 

SEQ ID NO: 48 is sequence ID 318013_region_A3_forward, hybridizes to 
coordinates 111727-111756 of contig 318013_region_A3, and is a primer used with SEQ 
ID NO: 49 to create RhgA amplicons. 

SEQ ED NO: 49 is sequence ID 318013_region_A3_reverse, hybridizes to 
coordinates 115206-115177 of contig 318013_region_A3, and is a primer used with SEQ 
ID NO: 48 to create RhgA amplicons. 

SEQ ID NO: 50 is sequence ID rhg4_A3244_amplicon_cds, which is the coding 
sequence portion of SEQ ED NO: 44. 

SEQ ED NO: 51 is sequence ED rhg4_Minsoy_amplicon_cds, which is the coding 
sequence portion of SEQ ED NO: 45. 

SEQ ED NO: 52 is sequence ED rhg4_Jack_amplicon_cds, which is the coding 
sequence portion of SEQ ED NO: 46. 

SEQ ED NO: 53 is sequence ID rhg4_peking_amplicon_cds, which is the coding 
sequence portion of SEQ ID NO: 47. 

SEQ ID NO: 1120 is sequence ID consensusLRR, which is a consensus sequence 
for the LRR repeats shown in Figures 1 and 2. 

SEQ ED NO: 1121 is sequence ED rhglLRR, which is the amino acid sequence of 
the LRR domain shown in Figure 1. 
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SEQ ID NO: 1 122 is sequence ED Rhg4LRR, which is the amino acid sequence 
of the LRR domain shown in Figure 2. 

SEQ ID NO: 1123 is sequence ID 240O17_region_G3_forward_l_b, which is an 
alternate primer that hybridizes to coordinates 45046-45072 on contig 240017_region_G3 
before the start codon, and which can be used with SEQ ED NO: 25. 

Table 1 below provides further information on the sequences described herein. 

In table 1, for all rows, "Seq Num" refers to the corresponding SEQ ED NO in the 
sequence listing. 

For rows with SEQ ED NOs: 1-53 and 1120-1123 "Seq ED" refers to the name of 
the SEQ ED NO given in the "Seq Num" column. 

For rows with SEQ ED NOs: 2-4, 8-23, and 44-47 "Coding Sequence" refers to 
the coordinates of the coding portion of the SEQ ED NO given in the "Seq Num" column, 
and "AA" refers to the SEQ ED NO that is the amino acid translation of the SEQ ED NO 
given in the "Seq Num" column. 

For rows with SEQ ED NOs: 24-27 and 1123, "Primer location on 
240017_region_G3" refers to the coordinates of the 240017_region_G3 contig to which 
the SEQ ED NO given in the "Seq Num" column hybridizes. 

For rows with SEQ ED NOs: 48 and 49, "Primer location on 318013_region_A3" 
refers to the coordinates of the 318013jregion_A3 contig to which the SEQ ED NO given 
in the "Seq Num" column hybridizes. 

For rows with SEQ ED NOs: 54-400, "Seq ED" refers to the names of amplicon 

sequences. Within the Seq ED is the " " (double length underscore) symbol. The name 

before this symbol refers to the name of the contig in which the amplicon is found, and 
the numbers after this symbol refer to the nucleotide location of the SSR on the contig. 

For rows with SEQ ED NOs: 401-1096, "Seq ED" refers to the names of primer 
sequences used in PCR to generate the amplicon sequences in table 1. For these rows, the 
"Seq ED" name contains the same name as the amplicon that is generated by the pair of 
primers of which the SEQ ED NO referred to in the first column is a member. The "Seq 
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ID" name also contains either "Forward" or "Reverse," which indicates the orientation of 
the primer. For these sequences, "location of primer on contig start" and "location of 
primer on contig end" refer, respectively, to the first and last base number of the contig on 



which the primer aligns. 

TABLE 1 



Seq Num 


Seq ID 






1 


515O02_region_G2 







Seq Num 


Seq ID 


Coding Sequence 


AA No. 


2 


240O17_region_G3 


45163-45314,45450-45509,46941- 
48763,48975-49573 


1097 


3 


240O17_region_G3 


46798-48763,48975-49573 


1098 


4 


318Q13_region_A3 


111805-113968,114684-115204 


1099 



Seq Num 


Seq ID 






5 


240O17_region_G3_8_mRNA 






6 


240O17_region_G3_8_cds 






7 


31801 3_region_A3_17_cds 







Seq Num 


Seq ID 


Coding Sequence 


AA No. 


8 


rhgl_A3244_amplicon 


1 13-264,400-459, 1 89 1-37 13,3925-4523 


1100 


9 


rhgl_A3244_amplicon 


1748-3713,3925-4523 


1101 


10 


rhg l_peking_amplicon 


113-264,400-459,1888-3710,3903-4501 


1102 


11 


rhgl_peking_amplicon 


1745-3710,3903-4501 


1103 


12 


rhg l_toyosuzu_amplicon 


1 1 3-264,400-459, 1 890-37 12,3924-4522 


1104 


13 


rhg l_toyosuzu_amplicon 


1747-3712,3924-4522 


1105 


14 


rhgl_will_amplicon 


113-264,400-459,1891-3713,3925-4523 


1106 


15 


rhg l_will_amplicon 


1748-3713,3925-4523 


1107 


16 


rhgl_a2704_amplicon 


1 13-264,400-459, 1 89 1-37 13,3925-4523 


1108 


17 


rhg l_a2704_amplicon 


1748-3713,3925-4523 


1109 


18 


rhg l_noir_amplicon 


113-264,400-459,1876-3698,3910-4508 


1110 


19 


rhg l_noir_amplicon 


1733-3698,3910-4508 


1111 


20 


rhgl_lee_amplicon 


113-264,400-459,1876-3698,3910-4508 


1112 


21 


rhgl_lee_amplicon 


1733-3698,3910-4508 


1113 


22 


rhg l_pi200499_amplicon 


113-264,400-459,1876-3698,3910-4508 


1114 


23 


rhgl_pi200499_amplicon 


1733-3698,3910-4508 


1115 



Seq Num 


Seq ID 


Primer location on 240O17_region_G3 




24 


240O17_region_G3_forward_l 


45051-45077 




25 


2400 17_region_G3_reverse_l 


47942-47918 




26 


240O17_region_G3_forward_2 


47808-47831 




27 


2400 1 7_region_G3_reverse_2 


49553-49531 





Seq Num 


Seq ID 






28 


rhgl_A3244_amplicon_cds 






29 


rhgl_peking_amplicon_cds 






30 


rhg l_toyosuzu_amplicon_cds 






31 


rhgl_will_amplicon_cds 
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Seq Num 


Seq ID 






32 


rhgl_a2704_amplicon_cds 






33 


rhg l_noir_amplicon_cds 






34 


rhg l_lee_amplicon_cds 






35 


rhg l_pi200499_amplicon_cds 






36 


rhgl_A3244_amplicon_cds_2 






37 


rhg l_peking_amplicon_cds_2 






38 


rhgl_toyosuzu_amplicon_cds_2 






39 


rhg l_will_amplicori_cds_2 






40 


rhgl_a2704_amplicon_cds_2 






41 


rhgl_noir_amplicon_cds_2 






42 


rhg l_lee_amplicon_cds_2 






43 


rhgl_pi200499_amplicon_cds_2 








Seq Num 


Seq ID 


Coding Sequence 


AA No. 


44 


rhg4_a3244_amplicon 


79-2242,2958-3478 


1116 


45 


rhg4_Minsoy_amplicon 


79-2242,2958-3478 


1117 


46 


rhg4_Jack_amplicon 


79-2242,2958-3478 


1118 


47 


rhg4_peking_amplicon 


79-2242,2958-3478 


1119 




Seq Num 


Seq ID 


Primer location on 318013_region_A3 




48 


31801 3_region_A3_forward 


111727-111756 




49 


3 1 SO 1 3_region_A3_reverse 


115206-115177 





Seq Num 


Seq ID 






50 


rhg4_A3244_amplicon_cds 






51 


rhg4_Minsoy_amplicon_cds 






52 


rhg4_Jack_amplicon_cds 






53 


rhg4_peking_amplicon_cds 







Seq Num 


Seq ID 






54 


240O17_region_G3_289711_ll 






55 


240O17_region_G3_236585_14 






56 


240O17_region_G3_168772_13 






57 


240O17_region_G3_332420_21 






58 


240O17_regionG3_228126_18 






59 


240O17_region_G3_139723_l 1 






60 


240O17_region_G3_280585_14 






61 


240O17_region_G3_70509_14 






62 


240O17_region_G3_50537_17 






63 


240O17_region_G3_23 1556_17 






64 


240O17_region_G3_l 17057_1 1 






65 


240O17_region_G3_23092_13 






66 


240O17_region_G3_297741_14 






67 


240017 region_G3_206502_14 






68 


240O17_region_G3_221223_13 






69 


240O17_region_G3_169084_14 






70 


2400 17_region_G3_9489 1_14 






71 


240O17_region_G3_28 1 852_6 1 






72 


240O17_region_G3_46583_12 






73 


2400 17_region_G3_306835_l 3 






74 


2400 17_region_G3_8547 1_12 






75 


240O17_region_G3_257208_12 
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Seq Num 


Seq ID 






76 


2400 17_region_G3_150390_17 






77 


2400 17_region_G3_34697_75 






78 


240O17_region_G3_150374_13 






79 


2400 17_region_G3_405 13_22 






80 


240O17_region_G3_268602_14 






81 


2400 17_region_G3_25357_13 






82 


2400 17_region_G3_l 37548_1 3 






83 


240O17_region_G3_13913 1_13 






84 


240O17_region_G3_203855_12 






85 


240O17_region_G3_199049_15 






86 


2400 1 7_region_G3_320907_l 2 






87 


240O17_region_G3_16407_17 






88 


2400 17_region_G3_2065 1 6_1 7 






89 


240O17_region_G3_264495_13 






90 


240O17_region_G3_156785_13 






91 


240O17_region_G3_187 129_12 






92 


240O17_region_G3_214106_13 






93 


240O17_region_G3_149013_12 






94 


240O17_region_G3_326352_16 






95 


240O17_region_G3_278962_12 






96 


240O17_region_G3_256930_13 






97 


240O17_region_G3_29646_14 






98 


240O17_region_G3_29618_13 






99 


240O17_region_G3_108561_14 






100 


240O17_region_G3_143975_14 






101 


2400 1 7_region_G3_10843 1_20 






102 


240017jegion_G3_281764_ll 






103 


2400 17_region_G3__13005 8_1 5 






104 


240O17_region_G3_3 10590_52 






105 


240O17_region_G3_313405_14 






106 


2400 17_region_G3_302 190_1 3 






107 


2400 17_region_G3_225343_17 






108 


240O17_region_G3_208823_14 






109 


240O17_region_G3_74285_l 1 






110 


240017 region_G3_109052_16 






111 


240O17_region_G3_6395_12 






112 


240O17_region_G3_244905_16 






113 


2400 1 7_region_G3_244956_l 3 






114 


2400 17_region_G3_l 17220_1 3 






115 


240O17_region_G3_134707_14 






116 


240O17_region_G3_35078_13 






117 


240O17_region_G3_210506_16 






118 


1240O17_region_G3_l 16961_26 






119 


2400 17_region_G3_5 1073_1 3 






120 


240O17_region_G3_55291_15 






121 


2400 17_region_G3_22965 1_1 8 






122 


2400 1 7_region_G3_303308_l 9 






123 


240O17_region_G3_168373_20 






124 


240O17_region_G3_253333_17 






125 


240O17_region_G3_5791_13 






126 


240O17_region_G3_206841_19 






127 


2400 17_region_G3_202827_12 






128 


2400 1 7_region_G3_322656_l 3 
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Seq Num 


Seq ID 






129 


240O17_region_G3_l 1 1841_14 






130 


240O17_region_G3_192719_13 






131 


240O17_region_G3_195630_17 






132 


240O17_region_G3_69999_13 






133 


240O17_region_G3_l 1 176_13 






134 


240O17_region_G3_228643_13 






135 


240O17_region_G3__88478_19 






136 


240O17_region_G3__108950_13 






137 


240O17_region_G3_121054_14 






138 


240O17_region_G3__188337_14 






139 


240O17_region_G3_255944_21 






140 


2400 17_region_G3_2195 1 8_14 






141 


240O17_regior>_G3_235601_15 






142 


240O17_region_G3_301529_13 






143 


240O17_region_G3_94795_14 






144 


240O17_region_G3_46703_23 






145 


240O17_region_G3_59616_14 






146 


240O17_region_G3_296933_15 






147 


240O17_region_G3_192428_17 






148 


240O17_region_G3_191490_14 






149 


240O17_region_G3_201 1 15_1 1 






150 


2400 17_region_G3_72882_15 






151 


2400 1 7_region_G3_695 14_1 3 






152 


240O17_region_G3_37699_47 






153 


240O17_regioii_G3_11301_29 






154 


240O17_region_G3_141875_12 






155 


240O17_region_G3_98090_18 






156 


240O17_region_G3_43298__35 






157 


240017 region G3 262094 11 






158 


240O17_region_G3_262079_15 






159 


240O17_region_G3_59090_12 






160 


2400 17_region_G3_245723_13 






161 


240O17_region_G3_194628_54 






162 


2400 17_region_G3_4566_l 6 






163 


240O17_region_G3_96209_14 






164 


240O17_region_G3_248715_17 






165 


2400 17_region_G3_7 14 10_40 






166 


240O17_region_G3_226519_13 






167 


2400 17_region_G3_l 1282_19 






168 


240O17_region_G3_170504_12 






169 


2400 17_region_G3_40864_14 






170 


240O17_region_G3_13529_14 






171 


240O17_region_G3_22858_14 






172 


240O17_region_G3_30921 1_13 






173 


240O17_region_G3_55568_26 






174 


240O17_region_G3_73238_16 






175 


2400 17_region_G3_52488_19 






176 


31801 3_region_A3_47 1 5 1 8_14 






177 


3 1 8013_region_A3_23 1599_23 






178 


3 18013_region_A3_375912_13 






179 


318O13_region_A3_180013_12 






180 


318O13_region_A3_171606_14 






181 


3 1 8013_region_A3_416256_13 
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Seq Num 


Seq ID 






182 


31801 3_region_A3_23 1 395_15 






183 


3 18O13_region_A3_5502_47 






184 


318O13_region_A3_93061_14 






185 


3 1 801 3_region_A3_l 1 1 684_19 






186 


3 1 8013_region_A3_69328_14 






187 


318013_region_A3_36529_17 






188 


318013_region_A3_139128_12 






189 


3 1 8013_region_A3_495674_13 






190 


3 1 80 13_region_A3_l 87577_13 






191 


318O13_regionA3_453036_14 






192 


3 1 80 13_region_A3_37404 1_13 






193 


3 18013_region_A3_3412_l 1 






194 


318013 region A3_276495_28 






195 


3 1 80 1 3_region_A3_15 1 839_17 






196 


318013_region_A3_292912_12 






197 


3 18O13_region_A3_104560_12 






198 


3 1 80 1 3_region_A3_65 1 93_1 1 






199 


318013_region_A3_l 10573_70 






200 


3 1 80 1 3_region_A3_65 1 17_12 






201 


3 18O13_region_A3_490837_16 






202 


3 1 801 3_region_A3_l 07448_1 1 






203 


31801 3_region_A3_3 3 1_23 






204 


318O13_region_A3_193470_13 






205 


318O13_region_A3_183305_14 






206 


318013 region_A3_5505D_14 






207 


3 18013_region_A3_224693_21 






208 


3 1 SO 1 3_region_A3_2072 1 6_1 2 






209 


318013_region_A3_4654_22 






210 


3 1 80 1 3_region_A3_408959_l 3 






211 


318013_region_A3_132288_22 






212 


3 18O13_region_A3_292822_20 






213 


3 1 8013_region_A3_3 1 1076_12 






214 


318O13_region_A3_509623_13 






215 


318013 region A3 190404_14 






216 


318013 region_A3_164916_15 






217 


318013 region A3_21028_13 






218 


3 1 8O13_region_A3_208012_17 






219 


31801 3_region_A3_484089_14 






220 


318O13_region_A3_332780_17 






221 


3 18O13_region_A3_480137_37 






222 


31801 3_region_A3__44 1 056_14 






223 


3 18013_region_A3_77486_l 1 






224 


318013_region_A3_272468_ll 






225 


3 1 8013_region_A3_4253 19_17 






226 


3 18013_region_A3_413879_3 1 






227 


31801 3_region_A3_80477_64 






228 


31801 3_region_A3_277272_50 






229 


3 1 8O13_region_A3_509642_13 






230 


318013_regionA3_321771_14 






231 


318013_region_A3_26788_12 






232 


318O13_region_A3_262706_16 






233 


31801 3_region_A3_243928_16 






234 


31801 3_region_A3_23246_14 
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235 


318O13_region_A3__165406_12 






236 


31801 3_region_A3_486294_14 






237 


31801 3_region_A3_46754_12 






238 


3 18013_region_A3_381 1 16_15 






239 


3 1 80 13_region_A3_350369_l 1 






240 


318013_region_A3_138841_13 






241 


3 1 8013_region_A3__12158_14 






242 


3 18013_region_A3_3 15368_13 






243 


3 1 8O13_region_A3_307549_13 






244 


318013_region_A3_159857_14 






245 


3 1 8O13_region_A3_14055 1_15 






246 


3 18013_region_A3_279869_l 1 






247 


3 1 80 13_region_A3_78292_35 






248 


318O13_region_A3_185019_12 






249 


318O13_region_A3_409164_13 






250 


318013_region_A3_75392_14 






251 


3 1 8013_region_A3_23 1320_12 






252 


3 1 801 3_region_A3_3 8 1 102_14 






253 


31S013_region_A3_491826_15 






254 


318013_region_A3_56365_21 






255 


3 18013_region_A3_372628_15 






256 


3 1 80 13_region_A3_302609_l 1 






257 


3 1 8013_region_A3_341 804_1 1 






258 


318O13_region_A3_217037_l 1 






259 


3 1 80 1 3_region_A3_264929_6 8 






260 


31S013_region_A3__55499_12 






261 


3 1 80 1 3_region_A3_295634_14 






262 


3 1 80 13_region_A3_26935 8_1 5 






263 


3 1 8O13_region_A3_457009_24 






264 


318013_region_A3_176598_14 






265 


318013_region_A3_278266_12 






266 


3 1 80 13_region_A3_39 1 8 10_12 






267 


3 1 80 13_region_A3_269485_15 






268 


318013 region_A3_359247_17 






269 


318013 region_A3_315094_13 






270 


3 18O13_region_A3_307823_13 






271 


318013_region_A3_248588_15 






272 


3 18013_region_A3_252426_85 






273 


3 1 801 3_region_A3_5 1 33 14_1 6 






274 


3 1 8013_region_A3_681 83_14 






275 


3 1 801 3_region_A3_47 1 19 1_1 3 






276 


3 1 8013_region_A3_163547_l 8 






277 


318013_region_A3_417867_15 






278 


318013_region_A3_332465_14 






279 


3 18O13_region_A3_207697_14 






280 


318013_region_A3_277229_43 






281 


3 1 8013_region_A3_36366_l 1 






282 


3 1 801 3_region_A3_9 1970_12 






283 


318013_region_A3_211533_ll 






284 


3 1 8O13_region_A3_336301_l 1 






285 


318O13_region_A3_441603_14 






286 


3 18013_region_A3_468354_15 






287 


3 1 80 13_region_A3_l 88983_1 8 
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288 


318O13_region_A3_115502_17 






289 


3 18O13_region_A3_163006_13 






290 


3 18013_region_A3_l 19283_14 






291 


3 1 80 1 3_region_A3_49 1 126_1 1 






292 


3 1 8013_region_A3_995 12_21 






293 


318O13_region_A3_280291_17 






294 


3 1 8013_region_A3_138443_19 






295 


318013_region_A3_l 15973_14 






296 


318013_region_A3_329977_14 






297 


318O13_region_A3_205203_14 






298 


3 1 8013_region_A3_153 1 14_12 






299 


318013_region_A3_34581_13 






300 


3 1 80 1 3_region_A3_292577_19 






301 


3 1 8013_region_A3_44539 1_20 






302 


318O13_region_A3_350540_17 






303 


318013_region_A3_453879_15 






304 


3 1 80 13_region_A3_20 1246_1 3 






305 


31801 3_region_A3_326020_l 3 






306 


3 1 80 13_region_A3_503 80 1_14 






307 


318O13_region_A3_302400_52 






308 


318013_region_A3_448857_15 






309 


3 1 80 13_region_A3_48364_14 






310 


3 1 SO 1 3_region_A3_25 1 804_48 






311 


318013_region_A3_382583_13 






312 


318013_region_A3_124737_14 






313 


318013_region_A3_124766_13 






314 


318013_region_A3_461351_16 






315 


318013_region_A3_64953_19 






316 


318013_region_A3_366586_J3 






317 


3 1 8013_region_A3_46 190_15 






318 


3 1 8013_region_A3_8 1016_1 1 






319 


3 1 80 1 3_region_A3_l 34426_14 






320 


318013_region_A3_292724_14 






321 


318013 region A3_187096_17 






322 


31801 3_region_A3_3 8 1 693_1 3 






323 


3 1 80 13_region_A3_36 1286_33 






324 


318013_region_A3_482668_14 






325 


3 1 8O13_region_A3_128002_12 






326 


31801 3_region_A3_499270_14 






327 


318O13_region_A3_231650_12 






328 


3 1 801 3_region_A3_l 9985 1_1 3 






329 


318013_region_A3_324629_13 






330 


318O13_region_A3__374190_19 






331 


318O13_region_A3_460603_13 






332 


318O13_region_A3_108681_14 






333 


318013_region_A3_459791_47 






334 


3 18O13_region_A3_4257_20 






335 


318O13_region_A3_238810_14 






336 


318013_region_A3_245817_14 






337 


3 1 801 3_region_A3_245956_14 






338 


3 18013_region_A3__74148_14 






339 


3 18O13_region_A3_74089_15 






340 


318013_region_A3_241686_12 
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341 


3 1 801 3_region_A3_47476_12 






342 


318O13_region_A3_164550_12 






343 


318O13_region_A3_101255_15 






344 


5 15O02_region_G2_161 89_1 1 






345 


5 15O02_region_G2_71925_13 






346 


515O02_region_G2_4707_12 






347 


5 15O02_region_G2_l 1 8904_1 8 






348 


5 15O02_region_G2_13655_17 






349 


5 15O02_region_G2_53900_13 






350 


515002 region_G2_8079_14 






351 


515O02_region_G2_9969_28 






352 


5 15O02_region_G2_72308_77 






353 


5 15O02_region_G2_99475_19 






354 


515O02_region_G2_118615_18 






355 


5 15O02_region_G2_l 19001_46 






356 


5 15O02_region_G2_l 18958_43 






357 


5 15O02_region_G2_17 197_13 






358 


515O02_region_G2_105163_29 






359 


5 15O02_region_G2_l 1 1335_13 






360 


5 15O02_region_G2_106396_13 






361 


5 15O02_region_G2_59229_17 






362 


5 15O02_region_G2_73795_20 






363 


5 15O02_region_G2_85664_20 






364 


5 15O02_region_G2_36921_17 






365 


515O02_region_G2_124150_19 






366 


515O02_region_G2_5089_14 






367 


5 15O02_region_G2_58221_15 






368 


5 15O02_region_G2_96139_14 






369 


5 15O02_region_G2„70595_13 






370 


515O02_region_G2_4340_15 






371 


5 15O02_region_G2_90417_l 1 






372 


5 15O02_region_G2_497 1 1_17 






373 


5 15O02_region_G2_63053_13 






374 


515002 region G2 63076 14 






375 


5 15O02_region_G2_44442_12 






376 


515002 region_G2_44422_19 






377 


515002 region G2 44158_19 






378 


515O02_region_G2_44141_17 






379 


5 15O02_region_G2_90762_17 






380 


515O02_region_G2_106241_14 






381 


5 15O02_region_G2_109676_12 






382 


5 15O02_region_G2_86242_14 






383 


5 15O02_region_G2_83 109_12 






384 


5 15O02_region_G2_10461_15 






385 


515O02_region_G2_67608_15 






386 


5 15O02_region_G2_63275_46 






387 


5 15O02_region_G2_62405_14 






388 


5 15O02_region_G2_33563_12 






389 


5 15O02_region_G2_33 146_14 






390 


5 15O02_region_G2_102179_29 






391 


5 15O02_region_G2_2646_15 






392 


5 15O02_region_G2_76652_24 






393 


5 15O02_region_G2_66280_14 
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394 


515O02_region_G2_54768_13 






395 


5 15O02_region_G2_62580_14 






396 


5 15O02_region_G2_34598_55 






397 


5 15O02_region_G2_77680_13 






398 


515O02_region_G2_77693_12 






399 


515O02_region_G2_97392_14 






400 


5 15O02_region_G2_97359_15 







Seq Num 


Seq ID 


location of 
primer on 
contig start 


location of 
primer on 
contig end 


401 


240017 region G3_289711_ll_Forward_Primer 


289637 


289661 


402 


240O17_region_G3_28971 1_1 l_Reverse_Primer 


289756 


289732 


403 


240017 region G3_236585_14_Forward_Primer 


236511 


236535 


404 


240O17_region_G3_236585_14_Reverse_Primer 


236638 


236614 


405 


240O17_region_G3_168772_13_Forward_Primer 


168683 


168707 


406 


2400 17_region_G3_168772_l 3_Reverse_Primer 


168811 


168786 


407 


240O17_region_G3_332420_21_Forward_Primer 


332375 


332399 


408 


240O17_region_G3_332420_21_Reverse_Primer 


332505 


332481 


409 


2400 1 7_region_G3_228 1 26_1 8_For ward_Primer 


228048 


228072 


410 


2400 17_region_G3_228 126_1 8_Reverse_Primer 


228182 


228158 


411 


240O17_region_G3_139723_ll_Forward_Primer 


139666 


139690 


412 


2400 17_region_G3_139723_l l_Reverse_Primer 


139802 


139778 


413 


240O17_region_G3_280585_14_Forward_Primer 


280524 


280550 


414 


240O17_region G3_280585_14_Reverse_Primer 


280661 


280637 


415 


240O17_region_G3_70509_14_Forward_Primer 


70478 


70502 


416 


240O17_region_G3_70509_14_Reverse_Primer 


70616 


70592 


417 


2400 1 7_region_G3_50537_ 1 7_For ward_Primer 


50455 


50479 


418 


240O17_region_G3_50537_17_Reverse_Primer 


50593 


50569 


419 


240O17_region_G3_231556_17_Forward_Primer 


231468 


231492 


420 


240017 region_G3_231556_17_Reverse_Primer 


231606 


231582 


421 


240017 region G3_l 17057_1 l_Forward_Primer 


117029 


117053 


422 


240017 region G3_117057_ll_Reverse_Primer 


117169 


117145 


423 


240017 region G3_23092_13_Forward_Primer 


23010 


23034 


424 


240O17_region_G3_23092_13_Reverse_Primer 


23151 


23127 


425 


240O17_region_G3_297741_14_Forward_Primer 


297680 


297704 


426 


240017 region G3 297741_14_Reverse_Primer 


297823 


297799 


427 


240O17_region_G3_206502_14_Forward_Primer 


206456 


206480 


428 


240O17_region_G3_206502_14_Reverse_Primer 


206600 


206581 


429 


240O17_region_G3_221223_13_Forward_Primer 


221134 


221158 


430 


240O17_region_G3_221223_13_Reverse_Primer 


221278 


221254 


431 


240017_region_G3_169084_14_Forward_Primer 


169051 


169075 


432 


2400 1 7_region_G3_169084_ 1 4_Reverse_Primer 


169196 


169173 


433 


240017_region_G3_94891_14_Forward_Primer 


94784 


94808 


434 


240O17_region_G3_94891_14_Reverse_Primer 


94929 


94905 


435 


240017_region_G3_7439_12_Forward_Primer 


7397 


7421 


436 


240O17_region_G3_7439_12_Reverse_Primer 


7542 


7518 


437 


2400 17_region_G3_28 1 852_6 l_Forward_Primer 


281797 


281821 


438 


240O17_region_G3_28 1 852_6 l_Reverse_Primer 


281943 


281919 


439 


240O17_region_G3_46583_12_Forward_Primer 


46554 


46578 


440 


240O17_region_G3_46583_12_Reverse_Primer 


46700 


46676 


441 


240O17_region_G3_306835_13_Forward_Primer 


306727 


306751 


442 


240O17_region_G3_306835_13_Reverse_Primer 


306874 


306849 
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443 


2400 17_region_G3_8547 1 _ 1 2__Forward_Primer 


85359 


85383 


444 


240O17_region_G3_85471_12_Reverse_Primer 


85507 


85483 


445 


240O17_region_G3_257208_12_Forward_Primer 


257129 


257153 


446 


240O17_region_G3_257208_12_Reverse_Primer 


257278 


257254 


447 


240O17_region_G3_150390_17_Forward_Primer 


150327 


150351 


448 


240O17_region_G3_150390_17_Reverse_Primer 


150476 


150452 


449 


240O17_region_G3_34697_75_Forward_Primer 


34662 


34685 


450 


240O17_region_G3_34697_75_Reverse_Primer 


34811 


34787 


451 


240O17_region_G3_150374_13_Forward_Primer 


150327 


150351 


452 


240O17_region_G3_150374_13_Reverse_Primer 


150476 


150452 


453 


240O17_region_G3_40513_22_Forward_Primer 


40422 


40446 


454 


240O17_region_G3_40513_22_Reverse_Primer 


40572 


40548 


455 


240O17_region_G3_268602_14_Forward_Primer 


268555 


268579 


456 


240O17_region_G3_268602_14_Reverse_Primer 


268705 


268681 


457 


240O17_region_G3_25357_13_Forward_Primer 


25271 


25295 


458 


240O17_region_G3_25357_13_Reverse_Primer 


25422 


25402 


459 


240O17_region_G3_137548_13_Forward_Primer 


139088 


139111 


459 


240O17_region_G3_137548_13_Forward_Primer 


137505 


137528 


460 


240O17_region_G3_137548_13_Reverse_Primer 


139239 


139215 


460 


240O17_region_G3_137548_13_Reverse_Primer 


137656 


137632 


461 


2400 17_region_G3_l 39 13 1_1 3_Forward_Primer 


139088 


139111 


462 


240O17_region_G3_139131_13_Reverse_Primer 


139239 


139215 


463 


2400 1 7_region_G3_203 855_1 2_Forward_Primer 


203749 


203773 


464 


240O17_region_G3_203855„12„Reverse_Primer 


203901 


203877 


465 


240O17_region_G3_199049_15_Forward„Primer 


199008 


199033 


466 


240O17_region_G3_199049_15_Reverse_Primer 


199160 


199136 


467 


240O17_region_G3_320907_12_Forward_Primer 


320885 


320906 


468 


240O17_region_G3_320907_12_Reverse_Primer 


321038 


321015 


469 


240O17_region_G3_16407_17_Forward_Primer 


16330 


16354 


470 


240O17_region_G3_16407_17_Reverse_Primer 


16483 


16459 


471 


2400 1 7_region_G3_2065 1 6_1 7_For ward_Primer 


206482 


206506 


472 


240O17_region_G3_206516_17_Reverse_Primer 


206635 


206616 


473 


240O17_region_G3_264495_13_Forward_Primer 


264423 


264447 


474 


240O17_region_G3_264495_13_Reverse_Primer 


264577 


264553 


475 


240017 region G3 156785 13 Forward_Primer 


156713 


156737 


476 


240O17_region_G3_156785_13_Reverse_Primer 


156868 


156844 


477 


240O17_region_G3_187129_12_Forward_Primer 


187068 


187092 


478 


240O17_region_G3_187129_12_Reverse_Primer 


187223 


187199 


479 


240O17_region_G3_214106_13_Forward_Primer 


214042 


214067 


480 


240O17_region_G3_214106_13_Reverse_Primer 


214197 


214173 


481 


240O17_region_G3_149013_12_Forward_Primer 


148898 


148922 


482 


240O17_region_G3_149013_12_Reverse_Primer 


149053 


149027 


483 


2400 17_region_G3_326352_l 6_Forward_Primer 


326311 


326335 


484 


240O17_region_G3_326352_16_Reverse_Primer 


326467 


326443 


485 


2400 17_region_G3_27 8962_ 1 2_For ward_Primer 


278933 


278957 


486 


240O17_region_G3_278962_12_Reverse_Primer 


279089 


279065 


487 


240O17_region_G3_256930_13_Forward_Primer 


256850 


256874 


488 


240O17_region_G3_256930_13_Reverse_Primer 


257006 


256982 


489 


240O17_region_G3_29646_14_Forward_Primer 


29589 


29613 


490 


2400 1 7_region_G3_29646_14_Reverse_Primer 


29746 


29721 


491 


2400 17_region_G3_296 1 8_13_Forward_Primer 


29589 


29613 
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492 


2400 17_region_G3_296 1 8_ 1 3_Reverse_Primer 


29746 


29721 


493 


240O17_region_G3_108561_14_Forward_Primer 


108518 


108542 


494 


240O17_region_G3_108561_14_Reverse_Primer 


108675 


108651 


495 


240O17_region_G3_143975_14_Forward_Primer 


143939 


143964 


496 


240O17_region_G3_143975_14_Reverse_Primer 


144096 


144072 


497 


240O17_region_G3_108431_20_Forward_Primer 


108362 


108386 


498 


2400 17__region_G3_10843 l_20_Reverse_Primer 


108520 


108497 


499 


240O17_region_G3_28 1764_1 l_Forward_Primer 


281645 


281669 


500 


2400 17_region_G3_28 1764_1 l_Reverse_Primer 


281803 


281779 


501 


2400 1 7 _region_G3_l 3005 8_1 5_Forward_Primer 


129994 


130018 


502 


240O17_region_G3_130058_15_Reverse_Primer 


130153 


130129 


503 


240O17_region_G3_310590_52_Forward_Primer 


310533 


310557 


504 


2400 17_region_G3_3 10590_52_Reverse_Primer 


310692 


310668 


505 


2400 17_region_G3_3 13405_1 4_Forward_Primer 


313345 


313369 


506 


240O17_region_G3_313405_14_Reverse_Primer 


313505 


313481 


507 


240O17_region_G3_302190_13_Forward_Primer 


302093 


302119 


508 


240O17_region_G3_302190_13_Reverse_Primer 


302253 


302229 


509 


240O17_region_G3_225343_17_Forward_Primer 


225315 


225338 


510 


240O17_region_G3_225343_17_Reverse_Primer 


225475 


225451 


511 


240O17_region_G3_208823_14_Forward_Primer 


208760 


208784 


512 


240O17_region_G3_208823_14_Reverse_Primer 


208921 


208897 


513 


2400 17_region_G3_74285_l l_Forward_Primer 


74220 


74244 


514 


240017 region_G3_74285_l l_ReverseJ>rimer 


74382 


74358 


515 


2400 1 7_region_G3_109052_l 6_Forward_Primer 


108999 


109023 


516 


240O17_region_G3_109052_16_Reverse_Primer 


109161 


109137 


517 


240O17_region_G3_6395_12_Forward_Primer 


6285 


6309 


518 


240O17_region_G3_6395_12_Reverse_Primer 


6447 


6423 


519 


240O17_region_G3_244905_16_Forward_Primer 


244865 


244890 


520 


2400 1 7_region J33_244905_l 6 JReverse JPrimer 


245028 


245004 


521 


240O17_region_G3_244956_13_Forward_Primer 


244865 


244890 


522 


240O17_region_G3_244956_13_Reverse_Primer 


245028 


245004 


523 


240O17_region_G3_117220_13_Forward_Primer 


117175 


117199 


524 


240O17_region_G3_117220_13_Reverse_Primer 


117339 


117315 


525 


240O17_region_G3_134707_14_Forward_Primer 


134584 


134608 


526 


2400 1 7 region_G3_l 34707_1 4_Reverse_Primer 


134749 


134725 


527 


2400 1 7 region_G3_35078_l 3_Forward_Primer 


34990 


35013 


528 


2400 17_region_G3_35078_l 3_Reverse_Primer 


35157 


35133 


529 


240O17_region_G3_210506_16_Forward_Primer 


210477 


210501 


530 


2400 1 7_region_G3_2 1 0506_1 6_Re verse_Primer 


210644 


210620 


531 


240O17_region_G3_116961_26_Forward_Primer 


116885 


116909 


532 


240017 region_G3_l 16961_26_Reverse_Primer 


117053 


117029 


533 


2400 17_region_G3_5 1 073_ 1 3_For ward_Primer 


50979 


51003 


534 


240O17_region_G3_51073_13_Reverse_Primer 


51147 


51123 


535 


2400 1 7_region_G3_5529 1_ 1 5_For ward_Primer 


55164 


55188 


536 


240O17_region_G3_55291_15_Reverse_Primer 


55333 


55309 


537 


240O17_region_G3_229651_18_Forward_Primer 


229615 


229639 


538 


2400 17_region_G3_22965 1_1 8_Reverse_Primer 


229784 


229760 


539 


2400 1 7_region_G3_30330 8_1 9_For ward_Primer 


303284 


303307 


540 


240O17_region_G3_303308_19_Reverse_Primer 


303454 


303429 


541 


240O17_region_G3_168373_20_Forward_Primer 


168262 


168286 


542 


240O17_region_G3_168373_20_ReverseJ?rimer 


168432 


168408 



27 



wo 
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location of 
primer on 
contig start 


location of 
primer on 
contig end 


543 


240O17_region_G3_253333_17_Forward_Primer 


253257 


253281 


544 


240O17_region_G3_253333_17_Reverse_Primer 


253428 


253404 


545 


2400 1 7_region_G3_579 1_ 1 3_Forward_Primer 


5766 


5790 


546 


2400 1 7_region_G3_579 1_1 3_Re verseJPrimer 


5937 


5912 


547 


2400 17_region_G3_20684 1_19 JForwardJPrimer 


206821 


206840 


548 


240O17_region_G3_206841_19_Reverse_Primer 


206993 


206969 


549 


240O17_region_G3_202827_12_Forward_Primer 


202782 


202806 


550 


240O17_region_G3_202827_12_Reverse_Primer 


202956 


202932 


551 


240O17_region_G3_322656_13_Forward_Primer 


322572 


322598 


552 


240O17_region_G3_322656_13_Reverse_Primer 


322748 


322724 


553 


240O17_region_G3_l 1 1841_14_Forward_Primer 


111709 


111733 


554 


240O17_region_G3_l 1 1841_14_Reverse_Primer 


111886 


111861 


555 


240O17_region_G3_192719_13_Forward_Primer 


192589 


192613 


556 


240O17_region_G3_192719_13_Reverse_Primer 


192767 


192743 


557 


240O17_region_G3_195630_17_Forward_Primer 


195490 


195514 


558 


240O17_region_G3_195630_17_Reverse_Primer 


195672 


195648 


559 


240O17_region_G3_69999_13_Forward_Primer 


69858 


69881 


560 


240O17_region_G3_69999_13_Reverse_Primer 


70040 


70016 


561 


240O17_region_G3_l 1 176_13_Forward_Primer 


11060 


11084 


562 


240O17_region_G3_l 1 176_13_Reverse_Primer 


11243 


11219 


563 


240O17_region_G3_228643_13_Forward_Primer 


228529 


228553 


564 


2400 17_region_G3_228643_l 3_Reverse_Primer 


22S713 


228689 


565 


240017 region Jj3_88478_19JForwardJPrimer 


88378 


88402 


566 


240O17„region_G3_88478_19_Reverse_Primer 


88562 


88538 


567 


240O17_region_G3_108950_13_Forward_Primer 


108838 


108858 


568 


240O17_region_G3_108950_13_Reverse_Primer 


109023 


108998 


569 


2400 17_region_G3_ 12 1 054_14_Forward_Primer 


120911 


120935 


570 


240O17jregion_G3_121054_14JReverse_Primer 


121096 


121072 


571 


240O17_region_G3_188337_14_Forward_Primer 


188204 


188228 


572 


240O17_region_G3_188337_14_Reverse_Primer 


191544 


191520 


572 


240O17_region_G3_188337_14_Reverse_Primer 


188391 


188367 


573 


240O17_region_G3_255944_21_Forward_Primer 


255879 


255903 


574 


240O17_region_G3_255944_21_Reverse_Primer 


256068 


256044 


575 


240O17_region_G3_219518_14_Forward_Primer 


219420 


219444 


576 


240017 region G3_219518_14_Reverse_Primer 


219609 


219585 


577 


240017 region G3_235601_15_Forward_Primer 


235483 


235507 


578 


240O17_region_G3_235601_15_Reverse_Primer 


235673 


235649 


579 


240017_region_G3_301529_13_Forward_Primer 


301498 


301522 


580 


240O17_region_G3_301529_13_Reverse_Primer 


301689 


301665 


581 


240017 region G3 94795_14_Forward_Primer 


94735 


94756 


582 


240O17_region_G3_94795_14_Reverse_Primer 


94929 


94905 


583 


240O17_region_G3_46703_23_Forward_Primer 


46676 


46700 


584 


2400 1 7_region_G3_46703_23_Reverse_Primer 


46870 


46846 


585 


2400 17 region_G3_596 1 6_1 4_Forward_Primer 


59539 


59563 


586 


240O17_region_G3_59616_14_Reverse_Primer 


59738 


59714 


587 


240O17_region_G3_296933_15_Forward_Primer 


296908 


296932 


588 


240O17_region_G3_296933_15_Reverse_Primer 


297113 


297089 


589 


240O17_region_G3_192428_17_Forward_Primer 


192402 


192426 


590 


240O17_region_G3_192428_17_Reverse_Primer 


192613 


192589 


591 


240O17_region_G3_191490_14_Forward_Primer 


191332 


191356 


592 


240O17_region_G3_191490_14_Reverse_Primer 


191544 


191520 



28 



wo 



01/51627 



PCT7US01/00552 



Seq Num 


Seq ID 


location of 
primer on 
contig start 


location of 
primer on 
contig end 


593 


240O17_region_G3_201 1 15_1 l_Forward_Primer 


200994 


201018 


594 


240O17_region_G3_201 1 15_1 l_Reverse_Primer 


201214 


201189 


595 


2400 1 7_region_G3_728 82_1 5_For wardJPrimer 


72848 


72874 


596 


240O17_region_G3_72882_15JReverse_Primer 


73068 


73042 


597 


240O17_region_G3_69514_13_Forward_Primer 


69411 


69437 


598 


2400 17_region_G3_695 1 4_ 1 3_Reverse_Primer 


69632 


69608 


599 


240O17_region_G3_37699_47_Forward_Primer 


37601 


37625 


600 


240O17_region_G3_37699_47_Reverse_Primer 


37827 


37802 


601 


240O17_region_G3_11301_29_Forward_Primer 


11274 


11300 


602 


240O17_region_G3_11301_29_Reverse_Primer 


11501 


11477 


603 


2400 17_region_G3_14 1 875_12_Forward_Primer 


141729 


141750 


604 


240O17_region_G3_141875_12_Reverse_Primer 


141964 


141939 


605 


2400 17_region_G3_98090_l 8_Forward_Primer 


98037 


98062 


606 


240O17_region_G3_98090_18_Reverse_Primer 


98274 


98250 


607 


240O17_region_G3_43298_35_Forward_Primer 


43144 


43168 


608 


240O17_region_G3_43298_35_Reverse_Primer 


43387 


43363 


609 


2400 17_region_G3_262094_l l_Forward_Primer 


261989 


262014 


610 


240O17_region_G3_262094_ll_Reverse_Primer 


262236 


262211 


611 


240O17_region_G3_262079_15_Forward_Primer 


261989 


262014 


612 


240O17_region_G3 262079_15_Reverse_Primer 


262236 


262211 


613 


240O17_region_G3_59090_12_Forward_Primer 


58986 


59012 


614 


2400 17 region_G3_59090_12_Reverse_Primer 


59248 


59224 


615 


240O17_region_G3_245723_13_Forward_Primer 


245502 


245526 


616 


2400 17_region_G3_245723_l 3_Reverse_Primer 


245766 


245742 


617 


240017 region G3_194628_54_Forward_Primer 


194581 


194607 


618 


2400 1 7_region_G3_ 1 9462 S_54_Re verse_Primer 


194846 


194822 


619 


2400 1 7_region_G3_45 66_ 1 6 _For ward_Primer 


4455 


4479 


620 


240O17_region_G3_4566_16_Reverse_Primer 


4722 


4696 


621 


240017 region G3_96209_14_Forward_Primer 


96119 


96143 


622 


240O17_region_G3_96209_14_Reverse_Primer 


96392 


96368 


623 


2400 17_region_G3_2487 15_17_Forward_Primer 


248633 


248657 


624 


2400 1 7_region_G3_24 87 1 5_ 1 7_Re verse_Primer 


248906 


248882 


625 


2400 17_region_G3_7 14 10_40_Forward_Primer 


71357 


71379 


626 


240O17_region_G3_71410_40_Reverse_Primer 


71636 


71611 


627 


240O17_region_G3_226519_13_Forward_Primer 


226315 


226339 


628 


2400 1 7_region_G3__2265 1 9_1 3_Re verse_Primer 


226598 


226574 


629 


240O17_region_G3_11282_19_Forward_Primer 


11217 


11242 


630 


240O17_region_G3_11282_19_Reverse_Primer 


11501 


11477 


631 


240O17_region_G3_170504_12_Forward_Primer 


170409 


170433 


632 


240017 region G3_170504_12_Reverse_Primer 


170694 


170671 


633 


240017 region G3_40864_14_Forward_Primer 


40652 


40678 


634 


240O17_region_G3_40864_14_Reverse_Primer 


40938 


40912 


635 


240O17_region_G3_13529_14_Forward_Primer 


13332 


13356 


636 


2400 1 7_region_G3_l 3529_ 1 4_Re verse_Primer 


13622 


13598 


637 


240O17_region_G3_22858_14_Forward_Primer 


22675 


22699 


638 


240O17_region_G3_22858_14_Reverse_Primer 


22966 


22942 


639 


2400 17_region_G3_3092 1 1_1 3_Forward_Primer 


309092 


309118 


640 


2400 17_region_G3_3092 1 1_1 3_Reverse_Primer 


309383 


309358 


641 


240O17_region_G3_55568_26_Forward_Primer 


55375 


55399 


642 


240O17_region_G3_55568_26_Reverse_Primer 


55667 


55642 


643 


240O17_region_G3_73238_16_Forward_Primer 


73043 


73069 



29 
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location of 
primer on 
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644 


240O17_region_G3_73238_16_Reverse_Primer 


73342 


73318 


645 


240O17_region_G3_52488_19_Forward_Primer 


52413 


52437 


646 


240O17_region_G3__52488_19_Reverse_Primer 


52712 


52688 


647 


3 1 801 3_region_A3_47 15 1 8_14 JForwardJPrimerjSeq 


471464 


471488 


648 


31801 3_region_A3_47 15 1 8_1 4_Reverse_Primer_Seq 


471567 


471541 


649 


3 1 80 13_region_A3_23 1599_23_Forward_Primer_Seq 


231568 


231592 


650 


318013_region_A3_231599_23_Reverse_Primer_Seq 


231672 


231651 


651 


31801 3_region_A3_3759 12_13_Forward_Primer_Seq 


375845 


375865 


652 


318013_region_A3_375912_13_Reverse_Primer_Seq 


375954 


375932 


653 


318O13_region_A3_180013_12_Forward_Primer_Seq 


179951 


179974 


654 


318O13_region_A3_180013_12_Reverse_Primer_Seq 


180060 


180038 


655 


31801 3_region_A3_17 1 606_ 1 4_For ward_Primer_Seq 


171545 


171569 


656 


318O13_region_A3_171606_14_Reverse_Primer_Seq 


171657 


171633 


657 


3 1 8013 _region_A3_4 16256_1 3 forward JPrimerJSeq 


416180 


416203 


658 


318013_region_A3_416256_13_Reverse_Primer_Seq 


416293 


416269 


659 


3 18013_region_A3_23 1395_15JForwardJPrimer_Seq 


231339 


231363 


660 


31801 3_region_A3_23 1395_15_Reverse_Primer_Seq 


231461 


231438 


661 


3 1 8O13_region_A3_5502_47Forward_Primer_Seq 


5461 


5485 


662 


3 1 80 1 3_region_A3 5502_47_Reverse_Primer_Seq 


5585 


5561 


663 


318O13_region_A3_93061_14JForwardJPrimer_Seq 


92988 


93012 


664 


3 1 80 1 3_region_A3_93061_14_Reverse_Primer_Seq 


93112 


93090 


665 


3 18013_region_A3_l 1 1684_19_Forward_Primer_Seq 


111646 


111670 


666 


318013_region_A3_l 1 1684_19_Reverse_Primer_Seq 


111772 


111748 


667 


318013 region A3_69328_14__Forward_Primer_Seq 


69246 


69269 


668 


318013 region_A3_69328_14_Reverse_Primer_Seq 


69373 


69349 


669 


3 1 80 1 3_region_A3_36529_17_Forward_Primer_Seq 


36488 


36512 


670 


318013_region_A3_36529_17JleverseJPrimer_Seq 


36617 


36593 


671 


3 1 80 13 region_A3_l 39 128_12 JForward JPrimerjSeq 


139043 


139067 


672 


318013_region_A3_139128_12_Reverse_Primer_Seq 


139174 


139150 


673 


318013_region_A3_495674_13_Forward_Primer_Seq 


495592 


495616 


674 


318013 region A3_495674_13_Reverse_Primer_Seq 


495723 


495699 


675 


3 1 80 1 3_region_A3_l 87577_1 3_Forward_Primer_Seq 


187482 


187506 


676 


31801 3_region_A3_l 87577_13_Reverse_Primer__Seq 


187613 


187590 


677 


31801 3_region_A3_453036_14_Forward_Primer_Seq 


452999 


453023 


678 


3 18013 region_A3_453036_14_Reverse_Primer_Seq 


453132 


453108 


679 


318013 region_A3_374041_l 3_Forward_Primer_Seq 


373964 


373988 


680 


318013 region A3 374041 13 Reverse_Primer_Seq 


374097 


374073 


681 


3 18013_region_A3_3412_l l_Forward_Primer_Seq 


3319 


3341 


682 


318013_region_A3_3412_ll_Reverse_Primer_Seq 


3454 


3430 


683 


318013_region_A3_276495_28_Forward_Primer_Seq 


276462 


276485 


684 


318013_region_A3_276495_28_Reverse_Primer_Seq 


276598 


276574 


685 


3 1 8013 region_A3_15 1 839_17_Forward_Primer_Seq 


151744 


151768 


686 


31801 3_region_A3_15 1 839_17_Reverse_Primer_Seq 


151882 


151858 


687 


3 1 80 13_region_A3_2929 12_12_For ward_Primer_Seq 


292875 


292899 


688 


318013_region_A3_292912_12_Reverse_Primer_Seq 


293014 


292990 


689 


318O13_region_A3_104560_12_Forward_Primer_Seq 


104464 


104488 


690 


318013 region_A3_104560_12_Reverse_Primer_Seq 


104604 


104580 


691 


318013 region_A3_65193_l l_ForwardJPrimer_Seq 


65155 


65179 


692 


31801 3_region_A3_65 193_1 l_Reverse_Primer_Seq 


65295 


65271 


693 


318O13_region_A3_110573_70_Forward_Primer_Seq 


110533 


110559 


694 


318O13_region_A3_110573_70_Reverse_Primer_Seq 


110674 


110648 
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695 


3 1 80 1 3_region_A3_65 1 17_12_Forward_Primer_Seq 


65034 


65058 


696 


3 1 80 13_region_A3_65 1 17_12_Reverse_Primer_Seq 


65177 


65153 


697 


318O13_region_A3_490837_16_Forward_Primer_Seq 


490762 


490786 


698 


318O13_region_A3_490837_16_Reverse_Primer_Seq 


490905 


490881 


699 


3 1 80 1 3_region_A3_107448_l l_Forward_Primer_Seq 


107385 


107411 


700 


3 1 SO 1 3_region_A3_107448_l l_Reverse_Primer_Seq 


107529 


107505 


701 


3 1 8013_region_A3_33 l_23_Forward_Primer_Seq 


276 


301 


702 


3 1 80 1 3_regionA3_33 l_23_Reverse_Primer_Seq 


421 


397 


703 


318O13_region_A3_193470_13_Forward_Primer_Seq 


193444 


193468 


704 


3 1 801 3_region_A3__l 93470_1 3_Reverse_Primer_Seq 


193589 


193565 


705 


3 1 80 1 3_region_A3__l 83305_14_Forward_Primer_Seq 


183239 


183263 


706 


3 1 80 1 3_region_A3_l 83305_14_Reverse_Primer_Seq 


183384 


183360 


707 


318013_region_A3_55050_14_Forward_Primer_Seq 


54998 


55022 


708 


3 1 80 1 3_region_A3_55050_14_Reverse_Primer_Seq 


55144 


55120 


709 


3 1 80 13_region_A3_224693_2 l_Forward_Primer_Seq 


224656 


224682 


710 


318013_region_A3_224693_21_Reverse_Primer_Seq 


224803 


224779 


711 


3 1 80 1 3_region_ A3_2072 1 6_1 2 JForward_Primer_Seq 


207152 


207176 


712 


3 1 80 1 3_region_A3_2072 16_12_Reverse_Primer_Seq 


207299 


207276 


713 


3 1 SO 1 3_region_A3_4654_22_Forward_Primer_Seq 


4612 


4636 


714 


3 1 80 13_region_A3_4654_22_Reverse_Primer_Seq 


4760 


4736 


715 


3 1 80 13_region_A3_408959_l 3_Forward_Primer_Seq 


408918 


408942 


716 


318O13_region_A3_408959_13_Reverse_Primer_Seq 


409066 


409042 


717 


318013_region_A3_132288_22_Forward_Primer_Seq 


132192 


132216 


718 


318013_region_A3_132288„22_Reverse_Primer„Seq 


132340 


132316 


719 


3 1 SO 1 3_region_A3_292822_20_Forward_Priraer_Seq 


292747 


292771 


720 


318O13_region_A3_292822_20_Reverse_Primer_Seq 


292895 


292871 


721 


3 18013 _region_A3_3 1 1076_12_Forward_Primer_Seq 


311027 


311051 


722 


3 1 8013_region_A3_3 1 1076_12_ReverseJ>rimer_Seq 


311175 


311152 


723 


3 1 80 1 3_region_A3_509623_l 3_Forward_Primer_Seq 


509584 


509608 


724 


3 1 80 1 3_region_A3_509623_ 1 3_Reverse_Primer_Seq 


509732 


509708 


725 


3 1 80 1 3_region_A3_190404_14_Forward_Primer_Seq 


190358 


190382 


726 


318O13_region_A3_190404_14_Reverse_Primer_Seq 


190506 


190482 


727 


3 1 80 1 3_region_A3_l 649 1 6_1 5_Forward_Primer_Seq 


164808 


164832 


728 


3 1 80 1 3_region_A3_1649 16_15_Reverse_Primer_Seq_ 


164957 


164933 


729 


318013 region_A3_21028_13JForwardJPrimer_Seq 


21001 


21026 


730 


318013 region A3 21028_13_Reverse_Primer_Seq 


21150 


21126 


731 


318013 region A3_208012_17_Forward_Primer_Seq 


207955 


207979 


732 


3 1 80 1 3_region_A3_208012_17_Reverse_Primer_Seq 


208104 


208085 


733 


318O13_region_A3_484089_14_Forward_Primer_Seq 


484036 


484060 


734 


3 1 80 1 3_region_A3_484089_14_Reverse_Primer_Seq 


484185 


484161 


735 


318O13_region_A3_332780_17_Forward_Primer_Seq 


332723 


332747 


736 


3 1 80 1 3_region_A3_332780_17_Reverse_Primer_Seq 


332872 


332853 


737 


318O13_region_A3_480137_37_Forward_Primer_Seq 


480059 


480084 


738 


3 1 80 1 3_region_A3_480 1 37_37_Reverse_Primer_Seq 


480208 


480182 


739 


31801 3_region_A3_44 1 05 6_1 4_For ward_Primer_Seq 


441011 


441035 


740 


31801 3_region_A3_441056_14_Reverse_Primer_Seq 


441161 


441138 


741 


3 18013_region_A3_77486_l l_Forward_Primer_Seq 


77447 


77471 


742 


3 1 80 1 3_region_A3_77486_l l_Reverse_Primer_Seq 


77597 


77573 


743 


3 18013_region_A3_272468_l l_Forward_Primer_Seq 


272423 


272447 


744 


318013_region_A3_272468_ll_Reverse_Primer_Seq 


272573 


272549 


745 


31801 3_region_A3_425 3 1 9_1 7_For ward_Primer_Seq 


425233 


425257 
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746 


3 1 80 13_region_A3_4253 1 9_17_Reverse_Primer_Seq 


425383 


425359 


747 


318013_region_A3_413879_31_Forward_Primer_Seq 


413835 


413859 


748 


318013_region_A3_413879_31_Reverse_Primer_Seq 


413985 


413961 


749 


318O13_region_A3_S0477_64_Forward_Primer_Seq 


80440 


80464 


750 


3 1 80 1 3_region_A3_80477_64 JReverse JPrimer_Seq 


80591 


80567 


751 


31801 3_region_A3_277272_50_Forward_Primer_Seq 


277213 


277237 


752 


318O13_region_A3_277272_50_Reverse_Primer_Seq 


277364 


277340 


753 


31801 3_region_A3_509642_l 3_Forward_Primer_Seq 


509604 


509628 


754 


318O13_region_A3_509642_13_Reverse_Primer_Seq 


509755 


509731 


755 


318013_region_A3_321771_14_Forward_Primer_Seq 


321663 


321687 


756 


3 1 80 1 3_region_A3_32 1 77 1_1 4_Reverse_Primer_Seq 


321815 


321791 


757 


318013_region_A3_26788_12_Forward_Primer_Seq 


26734 


26758 


758 


3 1 80 1 3_region_A3_26788_ 1 2_Reverse_Primer_Seq 


26886 


26862 


759 


31801 3_region_A3_262706_l 6_Forward_Primer_Seq 


262649 


262673 


760 


318O13_region_A3_262706_16_Reverse_Primer_Seq 


262802 


262778 


761 


318013_region_A3_243928_16_Forward_Primer_Seq 


243891 


243915 


762 


318013_region_A3_243928_16_Reverse_Primer_Seq 


244044 


244020 


763 


3 1 80 1 3_region_A3_23246_148_Forward_Primer_Seq 


23215 


23239 


764 


3 1 80 13_region_A3_23246_148_Reverse_Primer_Seq 


23368 


23344 


765 


3 1 80 13_region_A3_l 65406_12_Forward_Primer_Seq 


165367 


165391 


766 


318O13_region_A3_165406_12_Reverse_Primer_Seq 


165521 


165497 


767 


3 1 80 1 3_region_A3_486294_14 JForward_Primer_Seq 


486208 


486232 


768 


3 1 80 1 3_region_A3_486294_14_Reverse_Primer_Seq 


486362 


486338 


769 


3 1 80 1 3_region_A3_46754_12_Forward_Primer_Seq 


46661 


46685 


770 


318013_region_A3_46754_12__Reverse„Primer_Seq 


46816 


46792 


771 


3 18013_region_A3_38 1 1 16_15_Forward_Primer_Seq 


381080 


381104 


772 


31801 3_region_A3„3 8111 6_1 5_Reverse_Primer_Seq 


381235 


381211 


773 


318O13_region_A3_350369_ll_Forward_Primer_Seq 


350295 


350319 


774 


3 1 80 1 3_region_A3_350369_l l_Reverse_Primer_Seq 


350450 


350426 


775 


3 1 80 1 3_region_A3__13884 1_1 3_Forward_Primer_Seq 


138795 


138819 


776 


318013_region_A3_138841_13_Reverse_Primer_Seq 


138950 


138926 


777 


318013_region_A3_12158_142_Forward_Primer_Seq 


12117 


12141 


778 


318013_region_A3_12158_142_Reverse_Primer_Seq 


12272 


12248 


779 


318013_region_A3_315368_13_Forward_Primer_Seq 


315310 


315334 


780 


318013 region A3 315368 13_Reverse Primer_Seq 


315465 


315441 


781 


3 1 SO 13_region_A3_307549_l 3_Forward_Primer_Seq 


307464 


307488 


782 


3 1 80 1 3 region_A3_307549_l 3_Reverse_Primer_Seq 


307619 


307595 


783 


31801 3_region_A3_159857_14_Forward_Primer_Seq 


159772 


159796 


784 


318013_region_A3_159857_14_Reverse_Primer_Seq 


159928 


159904 


785 


3 1 80 13_region_A3_14055 l_15_Forward_Primer_Seq 


140454 


140478 


786 


318O13_region_A3_140551_15_Reverse_Primer_Seq 


140610 


140586 


787 


3 1 80 1 3_region_A3_279869_l l_Forward_Primer_Seq 


279797 


279821 


788 


31801 3_region_A3_279869_l l_Reverse_Primer_Seq 


279953 


279929 


789 


318013_region_A3_78292_35_Forward_Primer_Seq 


78265 


78291 


790 


318013_region_A3_78292_35_Reverse_Primer_Seq 


78422 


78397 


791 


3 1 80 13_region_A3_l 85019_12_Forward_Primer_Seq 


184953 


184977 


792 


318013 region A3_185019_12_Reverse_Primer_Seq 


185111 


185087 


793 


31801 3_region_A3_409 1 64_ 1 3_For ward_Primer_Seq 


409082 


409106 


794 


31801 3_region_A3_409 1 64_ 1 3_Reverse_Primer_Seq 


409240 


409219 


795 


318013_region_A3_75392_14_Forward_Primer_Seq 


75287 


75311 


796 


318013_region_A3_75392_14_Reverse_Primer_Seq 


75445 


75421 
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797 


3 1 801 3_region_A3 23 1 320_1 2_Forward_Primer_Seq 


231269 


231293 


798 


318O13_region_A3_231320_12_Reverse_Primer_Seq 


231429 


231405 


799 


318O13_region_A3_381102_14_ForwardJPrimer_Seq 


381041 


381064 


800 


318O13_region_A3_381102_14_Reverse_Primer_Seq 


381201 


381176 


801 


3 1 8013_region_A3_49 1 826_15_Forward_Primer_Seq 


491753 


491777 


802 


31801 3_region_A3_49 1 826_1 5_Reverse_Primer_Seq 


491914 


491891 


803 


318013_region_A3_56365_21_Forward_Primer_Seq 


56336 


56360 


804 


3 1 801 3_region_A3 56365_2 l_Reverse_Primer_Seq 


56497 


56473 


805 


3 1 8013_region_A3_372628_15_Forward_Primer_Seq 


372554 


372578 


806 


318013_region_A3_372628_15_Reverse_Primer_Seq 


372715 


372691 


807 


318O13_region_A3_217037_ll_Forward_Primer_Seq 


216919 


216943 


808 


3 18O13_region_A3_217037_l l_Reverse_Primer_Seq 


217081 


217057 


809 


318O13_region_A3_302609_ll_Forward_Primer_Seq 


302575 


302599 


810 


318O13_region_A3_302609_ll_Reverse_Primer_Seq 


302737 


302713 


811 


3 1 8013_region_A3_341 804_1 l_Forward_Primer_Seq 


341686 


341710 


812 


3 1 80 13_region_A3_34 1 804_1 l_Reverse_Primer_Seq 


341848 


341824 


807 


3 18O13_region_A3_217037_l l_Forward_Primer_Seq 


216919 


216943 


808 


318O13_region_A3_217037_ll_Reverse_Primer_Seq 


217081 


217057 


813 


3 1 80 1 3_region_A3_264929_68_Forward_Primer_Seq 


264862 


264886 


814 


318013_region_A3_264929_68_Reverse_Primer_Seq 


265024 


265000 


815 


318013_region_A3_55499_12_Forward_Primer_Seq 


55400 


55424 


816 


318013_region_A3_55499_12_Reverse_Primer_Seq 


55563 


55539 


817 


318013_region_A3_295634_14_Forward_Primer_Seq 


295538 


295562 


818 


318013_region_A3_295634_14_Reverse_Primer_Seq 


295702 


295677 


819 


3 1 80 13_region_A3_26935 8_1 5JForwardJPrimer_Seq 


269242 


269266 


820 


318013jregion_A3_269358_15JReversejPrimer_Seq 


269406 


269382 


821 


318O13_region_A3_457009_24JForwardJPrimer_Seq 


456924 


456948 


822 


3 1 80 1 3_region_A3_457009_24_Reverse_Primer_Seq 


457088 


457064 


823 


318013_region_A3_176598_14_Forward_Primer_Seq 


176554 


176578 


824 


318013_region_A3_176598_14_Reverse_Primer_Seq 


176718 


176694 


825 


31801 3_region_A3_278266_12_Forward_Primer_Seq 


278210 


278234 


826 


318013_region_A3_278266_12_Reverse_Primer_Seq 


278376 


278350 


827 


31801 3_region_A3_39 1 8 10_12_Forward_Primer_Seq 


391683 


391707 


828 


3 18013 region_A3_391810_12_Reverse_Primer_Seq 


391851 


391826 


829 


318013 region A3_269485_15_ForwardJPrirner_Seq 


269383 


269407 


830 


318013 region A3_269485_15_Reverse_Primer_Seq 


269551 


269527 


831 


318013 region A3_359247_17_Forward_Primer_Seq 


359218 


359243 


832 


31801 3_region_A3_359247_l 7_Reverse_Primer_Seq 


359386 


359362 


833 


3 1 801 3_region_A3_3 15094_1 3_Forward_Primer_Seq 


314976 


315002 


834 


3 1 80 1 3 region. A3_3 1 5094_ 1 3_Re verse_Primer_Seq 


315145 


315120 


835 


318013 region A3_307823_13_Forward_Primer_Seq 


307784 


307809 


836 


31801 3_region_A3_307823_l 3_Reverse_Primer_Seq 


307953 


307927 


837 


318013_region_A3_248588_15_Forward_Primer_Seq 


248540 


248564 


838 


318013_region_A3_248588_15_Reverse_Primer_Seq 


248709 


248684 


839 


31801 3_region_A3_252426_85_Forward_Primer_Seq 


252398 


252423 


840 


318013_region_A3_252426_85_Reverse_Primer_Seq 


252568 


252543 


841 


31801 3_region_A3_5 133 14_1 6_Forward_Primer_Seq 


513209 


513233 


842 


3 1 80 1 3_region_A3_5 133 14_1 6_Reverse_Primer_SecL 


513379 


513355 


843 


3 1 801 3_region_A3_68 1 83_14_Forward_Primer_Seq 


68108 


68132 


844 


3 1 80 13_region_A3_68 1 83_14_Reverse_Primer_Seq 


68279 


68255 


845 


3 18013_region_A3_47 1 19 l_13_Forward_Primer_Seq 


471059 


471083 
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846 


318013_region_A3_471191_13_Reverse_Primer_Seq 


471231 


471206 


847 


3 1 8013_region_A3__163547_l 8_Forward_Primer_Seq 


163459 


163483 


848 


3 1 8013_region_A3_163547_l 8_Reverse_Primer_Seq 


163632 


163608 


849 


318013_region_A3_417867_15_Forward_Primer_Seq 


417839 


417863 


850 


318013_region_A3_417S67_15_Reverse_Primer_Seq 


418014 


417990 


851 


318013_region_A3_332465_14JForwardJPrimer_Seq 


332346 


332370 


852 


3 1 8013_region_A3_332465_14_Reverse_Primer_Seq 


332523 


332499 


853 


318O13_region_A3_207697_14_Forward_Primer_Seq 


207578 


207602 


854 


318O13_region_A3_207697_14_Reverse_Primer_Seq 


207755 


207731 


855 


318013_region_A3_277229_43_Forward_Primer_Seq^ 


277186 


277210 


856 


318013_region_A3_277229_43_Reverse_Primer_Seq 


277364 


277340 


857 


318013_region_A3_36366_ll_Forward_Primer_Seq 


36323 


36345 


858 


318013_region_A3_36366_ll_ReverseJPrimer_Seq 


36501 


36477 


859 


318O13_region_A3_91970_12_Forward_Primer_Seq 


91938 


91962 


860 


318O13_region_A3_91970_12_Reverse_Primer_Seq 


92116 


92091 


861 


318013_region_A3_211533_ll_Forward_Primer_Seq 


211406 


211430 


862 


3 1 80 13_region_A3_21 1533_1 l_Reverse_Primer_Seq 


211585 


211561 


863 


318013 region A3_336301_ll_Forward_Primer_Seq 


336174 


336198 


864 


318O13_region_A3_336301_ll_Reverse_Primer_Seq 


336353 


336329 


865 


31801 3_region_A3_44 1 603_14_Forwai-d_Primer_Seq 


441539 


441563 


866 


3 1 80 1 3_region_A3_44 1 603_1 4_Reverse_Primer_Seq 


441718 


441694 


867 


318013_region_A3_468354_15_Forward_Primer_Seq 


468263 


468287 


868 


318013_region_A3_468354_15_Reverse_Primer_Seq 


468442 


468418 


869 


3 1 8013 _region_A3_l 88983_1 8_Forward_Primer_Seq 


188855 


188879 


870 


3 1 80 1 3_region_A3_188983_l 8_Reverse_Primer_Seq 


189035 


189009 


871 


318O13_region_A3_115502J7J?orwardJPrimer_Seq 


115469 


115493 


872 


3 1 SO 1 3_region_A3_l 1 5502_17_Reverse_Primer_Seq 


115649 


115625 


873 


318O13_region_A3_163006_13_Forward_Primer_Seq 


162972 


162996 


874 


3 1 801 3_region_A3_l 63006_1 3 Jteverse_Primer__Seq 


163153 


163129 


875 


3 1 8013_region_A3_l 19283_14_Forward_Primer_Seq 


119199 


119224 


876 


318013_region_A3_119283_14_Reverse_Primer_Seq 


119381 


119357 


877 


3 18013_region_A3_491 126_1 l_Forward_Primer_Seq 


491062 


491086 


878 


318013_region_A3_491126_ll_Reverse_Primer_Seq 


491244 


491220 


879 


318013_region_A3_99512_21_Forward_Primer_Seq 


99398 


99422 


880 


3 18013 region_A3_995 12_21 JleverseJPrimer_Seq 


99581 


99557 


881 


318013 region_A3_280291_17_Forward_Primer_Seq 


280201 


280226 


882 


31801 3_region_A3_28029 1_1 7_Reverse_Primer_Seq 


280385 


280361 


883 


318013_region_A3_138443_19_Forward_Primer_Seq 


138304 


138329 


884 


318013_region_A3_138443_19_Reverse_Primer_Seq 


138488 


138465 


885 


318013 region A3 115973 14 Forward_Primer Seq 


115832 


115856 


886 


3 18013_region_A3_l 15973_14_Reverse_Primer_Seq 


116016 


115992 


887 


31801 3_region_A3_329977_14_Forward_Primer_Seq 


329864 


329889 


888 


318013_region_A3_329977_14_Reverse_Primer_Seq 


330050 


330026 


889 


318O13_region_A3_205203_14_Forward_Primer_Seq 


205090 


205115 


890 


31801 3_region_A3_205203_ 1 4_Re verse_Primer_Seq 


205276 


205252 


891 


3 1 8013_region_A3_153 1 14_12_Forward_Primer_Seq 


152969 


152993 


892 


318013_region_A3_153114_12_Reverse_Primer_Seq 


153156 


153132 


893 


31801 3_region_A3_3458 1_1 3 JForward JPrimer__Seq_ 


34523 


34547 


894 


318013_region_A3_34581_13_Reverse_Primer_Seq 


34712 


34688 


895 


3 1 8013_region_A3_292577_l 9_Forward_Primer_Seq 


292549 


292573 


896 


3 1 80 13_region_A3_292577_l 9_Reverse_Primer_Seq 


292739 


292715 
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897 


318O13_region_A3_445391_20_Forward_Primer_Seq 


445356 


445382 


898 


318O13_region_A3_445391_20_Reverse_Primer_Seq 


445547 


445523 


899 


318O13_region_A3_350540_17_Forward_Primer_Seq 


350421 


350445 


900 


318O13_region_A3_350540_17_Reverse_Primer_Seq 


350612 


350588 


901 


3 1 80 1 3_region_A3_453879_15_Forwaxd_Primer_Seq 


453725 


453750 


902 


31801 3_region_A3_453 879_1 5_Reverse_Primer_Seq 


453918 


453894 


903 


3 1 801 3_region_A3_20 1246_13_Forward_Primer_Seq 


201128 


201153 


904 


31801 3_region_A3_20 1 246_ 1 3_Reverse_Primer_Seq 


201321 


201297 


905 


318O13jregion_A3_326020_13_ForwardJPrimer_Seq 


325902 


325927 


906 


3 1 80 13_region_A3_326020_l 3_Reverse_Primer_Seq 


326095 


326071 


907 


318O13_region_A3_503801_14_Forward_Primer_Seq 


503656 


503680 


908 


318O13_region_A3_503801_14_Reverse_Primer_Seq 


503849 


503823 


909 


318O13_region_A3_302400_52_Forward_Primer_Seq 


302283 


302307 


910 


318O13_region_A3_302400_52_Reverse_Primer_Seq 


302481 


302456 


911 


318013_region_A3_448857_15_Forward_Primer_Seq 


448748 


448772 


912 


3 1 80 1 3_region_A3_448857_l 5_Reverse_Primer_Seq 


448947 


448924 


913 


31801 3_region_A3_483 64_1 4_Forward_Primer_Seq 


48232 


48256 


914 


3 1 80 1 3_region_A3_48364_14_Reverse_Primer_Seg_ 


48435 


48412 


915 


3 1 80 13_region_A3_25 1 804_48_Forward_Primer_Seq 


251738 


251762 


916 


3 1 80 1 3_region_A3_25 1 804_48_Reverse_Primer_Seq_ 


251942 


251918 


917 


318013_region_A3_382583_13_Forward_Primer_Seq 


382549 


382574 


918 


318013_region_A3_382583_13_Reverse_Primer_Seq 


382753 


382728 


919 


318013_region_A3 124737_14_Forward_Primer_Seq 


124641 


124665 


920 


31801 3_region_A3 1 247 3 7_1 4_Reverse_Primer_Seq 


124846 


124822 


921 


318013_region_A3_124766_13_Forward_Primer_Seq 


124641 


124665 


922 


318013_region_A3_124766_13_Reverse_Primer_Seq 


124846 


124822 


923 


31801 3_region_A3_46 135 1_1 6_Forward_Primer_Seq 


461218 


461242 


924 


31801 3_region_A3_46 1 35 1_1 6_Reverse_Primer_Seq 


461426 


461402 


925 


31801 3_region_A3_64953_19_Forward_Primer_Seq 


64798 


64823 


926 


318013_region_A3_64953_19_Reverse_Primer_Seq 


65011 


64987 


927 


318013_region_A3_366586_13JForward_Primer_Seq 


366508 


366532 


928 


318013_region_A3_366586_13_Reverse_Primer_Seq 


366722 


366698 


929 


318O13_region_A3_46190_15_Forward_Primer_Seq 


46012 


46037 


930 


318O13_region_A3_46190_15_Reverse_Primer_Seq 


46228 


46205 


931 


31801 3_region_ A3_8 1 0 1 6_ 1 1 _For war d_Primer_Seq 


80927 


80952 


932 


31801 3_region_A3_8 1 0 1 6_1 1 _Reverse_Primer_Seq 


81146 


81122 


933 


318013 region A3 134426 14 Forward Primer_Seq 


134253 


134277 


934 


31801 3_region_A3_134426_14_Reverse_Primer_Seq 


134474 


134449 


935 


318013_region_A3_292724_14_Forward_Primer_Seq 


292549 


292573 


936 


318013_region_A3_292724_14_Reverse_Primer_Seq 


292771 


292747 


937 


3 1 8013_region_A3_l 87096_17_Forward_Primer_Seq 


187058 


187082 


938 


318013 region A3 187096_17 Reverse_Primer_Seq 


187282 


187257 


939 


318013 region A3 381693 13 Forward_Primer_Seq 


381658 


381683 


940 


318013_region_A3_381693_13_Reverse_Primer_Seq 


381885 


381863 


941 


31801 3_region_A3_3 6 1 286_33_Forward_Primer_Seq 


361173 


361197 


942 


31801 3_region_A3_3 6 1 28 6_3 3_Re verse_Primer_Seq 


361401 


361376 


943 


3 1 8013_region_A3_482668_14_Forward_Primer_Seq 


482592 


482616 


944 


318013_region_A3_482668_14_Reverse_Primer_Seq 


482821 


482796 


945 


318013 region_A3_128002_12_Forward_Primer_Seq 


127882 


127906 


946 


3 1 8O13_region_A3_128002_l 2_Reverse_Primer_Seq 


128112 


128087 


947 


318O13_region_A3_499270_14_Forward_Primer_Seq 


499184 


499208 
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contig end 


948 


318O13_region_A3_499270_14_Reverse_Primer_Seq 


499422 


499398 


949 


318O13_region_A3_231650_12_Forward_Primer_Seq 


231568 


231592 


950 


3 1 8013_region_A3_23 1650_12_Reverse_Primer_Seq 


231809 


231788 


951 


318013_region_A3_199851_13_Forward_Primer_Seq 


199762 


199786 


952 


3 1 80 13_region_A3__19985 1_1 3_Reverse_Primer_Seq 


200012 


199988 


953 


3 1 801 3_region_A3_324629_l 3_Forward_Primer_Seq 


324540 


324564 


954 


31801 3_region_A3_324629_ 1 3_Reverse_Primer_Seq 


324790 


324766 


955 


318O13_region_A3_374190_19_Forward_Primer_Seq_ 


374129 


374152 


956 


318O13_region_A3_374190_19_Reverse_Primer_Seq 


374394 


374370 


957 


3 1 80 1 3_region_A3_460603_13_Forward_Primer_Seq 


460450 


460474 


958 


3 1 80 13_region_A3_460603_l 3_Reverse_Primer_Seq 


460715 


460691 


959 


3 1 8O13_region_A3_10868 l_14_Forward_Primer_Seq 


108524 


108548 


960 


318O13_region_A3_108681_14_Reverse_Primer_Seq 


108791 


108768 


961 


3 1 80 13_region_A3_45979 l_47_Forward_Primer_Seq 


459639 


459663 


962 


318013_region_A3_459791_47_Reverse_Primer_Seq 


459907 


459883 


963 


318O13_region_A3_4257_20_Forward_Primer_Seq 


4172 


4196 


964 


31801 3_region_A3_4257_20_Reverse_Primer_Seq 


4450 


4425 


965 


31801 3_region_A3_23 8 8 10_1 4_Forward_Primer_Seq 


238563 


238589 


966 


31801 3_region_A3_23 8 8 1 0_1 4_Reverse_Primer_Seq 


238850 


238826 


967 


318013_region_A3_245817_14_Forward_Primer_Seq 


245713 


245738 


968 


31801 3_region_A3_245 8 1 7_14_Reverse_Primer_Seq 


246001 


245977 


969 


318013_region_A3_245956_14_Forward_Primer_Seq 


245713 


245738 


970 


318013_region_A3_245956_14_Reverse_Primer_Seq 


246001 


245977 


971 


318013_region_A3_74148_14_Forward_Primer_Seq 


74050 


74075 


972 


3 18013 region„A3_74148_14JReverseJPrimer_Seq 


74338 


74314 


973 


318O13_region_A3_74089_15_Forward_Primer_Seq 


74050 


74075 


974 


318O13_region_A3_74089_15_Reverse_Primer_Seq 


74338 


74314 


975 


31801 3_region_A3_241 686_12_Forward_Primer_Seq 


241470 


241494 


976 


3 1 80 1 3_region_A3_24 1686_12_Reverse_Primer_Seq 


241765 


241741 


977 


318013_region_A3_47476_12_Forward_Primer_Seq 


47280 


47304 


978 


3 1 80 1 3_region_A3_47476_127_Reverse_Primer_Seq 


47577 


47554 


979 


31801 3_region_A3_164550_12_Forward_Primer_Seq 


164323 


164347 


980 


31801 3_region_A3_l 64550_1 2_Reverse_Primer_Seq 


164621 


164598 


981 


31801 3_region_A3_10 1255_15_Forward_Primer_Seq 


101119 


101144 


982 


318O13_region_A3_101255_15_Reverse_Primer_Seq 


101418 


101392 


983 


515O02_region_G2_16189_ll_Forward_Primer 


16144 


16168 


984 


5 15O02 region_G2_16189_l l_Reverse_Primer 


16244 


16220 


985 


515O02_region_G2_71925_13_Forward_Primer 


71880 


71905 


986 


5 1 5O02_region_G2_7 1 925_ 1 3_Reverse_Primer 


71987 


71963 


987 


515O02_region_G2_4707_12_Forward_Primer 


4660 


4684 


988 


515002 region G2 4707_12Reverse_Primer 


4769 


4743 


989 


5 15O02_region_G2_l 1 8904_1 8_Forward_Primer 


118847 


118871 


990 


5 15O02_region_G2_l 18904_1 8_Reverse_Primer 


118957 


118932 


991 


515O02_region_G2_13655_17_Forward_Primer 


13567 


13592 


992 


515O02_region_G2_13655_17_Reverse_Primer 


13698 


13673 


993 


5 15O02_region_G2_53900_l 3_Forward_Primer 


53843 


53867 


994 


515O02_region_G2_53900_13_Reverse_Primer 


53985 


53961 


995 


515O02_region_G2_8079_14_Forward_Primer 


8023 


8047 


996 


515O02_region_G2_8079_14_Reverse_Primer 


8167 


8143 


997 


515O02_region_G2_9969_28_Forward_Primer 


9917 


9941 


998 


515O02_region_G2_9969_28_Reverse_Primer 


10062 


10038 
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contig end 


999 


515O02_region_G2_72308_77_Forward_Primer 


72272 


72298 


1000 


515O02_region_G2_72308_77_Reverse_Primer 


10062 


10038 


1001 


515O02_region_G2_99475_19_Forward_Primer 


99408 


99433 


1002 


515O02_region_G2_99475_19_Reverse_Primer 


99554 


99530 


1003 


5 15O02_region_G2_l 18615_18_Forward_Primer 


118512 


118535 


1004 


5 15O02_region_G2_l 1 86 15_1 8_Reverse_Primer 


118658 


118634 


1005 


515O02_region_G2_119001_46_Forward_Primer 


118931 


118956 


1006 


515O02_region_G2_119001_46_Reverse_Primer 


119079 


119055 


1007 


515002_region_G2_118958_43_Forward_Primer 


118931 


118956 


1008 


515O02_regioti_G2_118958_43_Reverse_Primer 


119079 


119055 


1009 


515O02_region_G2_17197_13_Forward_Primer 


17128 


17152 


1010 


515O02_region_G2_17197_13_Reverse_Primer 


17276 


17252 


1011 


5 15O02_region_G2_105 163_29_Forward_Primer 


105068 


105092 


1012 


515O02_region_G2_105163_29_Reverse_Primer 


105217 


105192 


1013 


5 1 5 O02_region_G2_l 1 1 335_ 1 3_For ward_Primer 


111308 


111332 


1014 


5 15O02_region_G2_l 1 1335_13_Reverse_Primer 


111458 


111434 


1015 


515O02_region_G2_106396_13_Forward_Primer 


106318 


106342 


1016 


515O02_region_G2_106396_13_Reverse_Primer 


106469 


106445 


1017 


515O02_region_G2_59229_17_Forward_Primer 


59203 


59227 


1018 


5 1 5 O02_region_G2_59229_l 7_Reverse_Primer 


59354 


59330 


1019 


515O02_region_G2_73795_20_Forward_Primer 


73769 


73793 


1020 


515O02_region_G2_73795_20_Reverse_Primer 


73921 


73896 


1021 


515O02_region_G2_85664_20_Forward_Primer 


85586 


85611 


1022 


515O02_region_G2_85664„20__Reverse_Primer 


85738 


85714 


1023 


515O02_region_G2_36921_17_Forward_Primer 


36830 


36854 


1024 


515O02_region_G2_36921_17_Reverse_Primer 


36983 


36959 


1025 


515O02_region_G2_124150_19_Forward_Primer 


124073 


124096 


1026 


515O02_region_G2_124150_19_Reverse_Primer 


124227 


124203 


1027 


515O02_region_G2_5089_14_Forward_Primer 


4999 


5024 


1028 


515O02_region_G2_5089_14_Reverse_Primer 


5156 


5132 


1029 


515O02_region_G2_58221_15_ForwardJPrimer 


58197 


58220 


1030 


515O02_region_G2_58221_15_Reverse_Primer 


58354 


58330 


1031 


515O02_region_G2_96139_14_Forward_Primer 


96022 


96046 


1032 


5 1 5 O02_region_G2_96 1 39_14_Reverse_Primer 


96182 


96158 


1033 


5 15O02_region_G2_70595_l 3_Forward_Primer 


70472 


70496 


1034 


5 1 5 O02_region_G2_70595_ 1 3_Reverse_Primer 


70634 


70608 


1035 


515002 region G2_4340_15_Forward_Primer 


4312 


4337 


1036 


515O02_region_G2_4340_15_Reverse_Primer 


4477 


4454 


1037 


5 15O02_region_G2_90417_l l_Forward_Primer 


90335 


90359 


1038 


5 15O02_region_G2_90417_l l_Reverse_Primer 


90503 


90479 


1039 


515O02_region_G2_49711_17_Forward_Primer 


49652 


49676 


1040 


515O02_region_G2_49711_17_Reverse_Primer 


49820 


49796 


1041 


5 15O02 region_G2_63053_13_Forward_Primer 


63005 


63029 


1042 


515O02_region_G2_63053_13_Reverse_Primer 


63173 


63148 


1043 


515O02_region_G2_63076_14_Forward_Primer 


63005 


63029 


1044 


515O02_region_G2_63076_14_Reverse_Primer 


63173 


63148 


1045 


5 1 5 O02_region_G2_44442_ 1 2_For ward_Primer 


44335 


44359 


1046 


515O02_region_G2_44442_12_Reverse_Primer 


44505 


44481 


1047 


515O02_region_G2_44422_19_Forward_Primer 


44335 


44359 


1048 


5 15O02 region_G2_44422_19_Reverse_Primer 


44505 


44481 


1049 


515O02_region_G2_44158_19_Forward_Primer 


44075 


44100 
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1050 


515 O02_region_G2_44 1 5 8_ 1 9_Re ver s e_Primer 


44252 


44227 


1051 


515O02_region_G2_44141_17_Forward_Primer 


44075 


44100 


1052 


515O02_region_G2_44141_17_Reverse_Primer 


44252 


44227 


1053 


515O02_region_G2_90762_17_Forward_Primer 


90637 


90663 


1054 


515O02_region_G2_90762_17_Reverse_Primer 


90814 


90790 


1055 


515O02_region_G2_106241_14_Forward_Primer 


106160 


106184 


1056 


5 15O02_region_G2_10624 l_14_Reverse_Primer 


106341 


106317 


1057 


5 1 5 O02_region_G2_ 1 09 67 6_ 1 2_For ward_Primer 


109609 


109634 


1058 


515O02_region_G2_109676_12_Reverse_Primer 


109793 


109768 


1059 


515O02_region_G2_86242_14_Forward_Primer 


86134 


86158 


1060 


515O02_region_G2_86242_14_Reverse_Primer 


86318 


86293 


1061 


5 15O02_region_G2_83 1 09_12_Forward_Primer 


83017 


83041 


1062 


5 15O02_region_G2_83 109_12_Reverse_Primer 


83202 


83178 


1063 


515002_region_G2_10461_15_Forward_Primer 


10418 


10442 


1064 


515O02_region_G2_10461_15_Reverse_Primer 


10609 


10585 


1065 


515O02_region_G2_67608_15_ForwardJPrimer 


67552 


67577 


1066 


515O02_region_G2_67608_15_Reverse_Primer 


67745 


67721 


1067 


515O02_region_G2_63275_46_Forward_Primer 


63148 


63173 


1068 


515O02_region_G2_63275_46_Reverse_Primer 


63347 


63323 


1069 


515O02_region_G2_62405_14_Forward_Primer 


62374 


62399 


1070 


515O02_region_G2_62405_14_Reverse_Primer 


62576 


62552 


1071 


515O02_region_G2_33563_12_Forward_Primer 


33460 


33484 


1072 


515 O02_region_G2__3 3 5 6 3_ 1 2_Re ver se_Primer 


33670 


33646 


1073 


5 15O02_regioiLG2__33 146_14JForward_Primer 


32949 


32973 


1074 


515O02_region_G2_33146_14_Reverse_Primer 


33191 


33167 


1075 


515O02_region_G2_102179_29_Forward_Primer 


102102 


102126 


1076 


515O02_region_G2_102179_29_Reverse_Primer 


102352 


102327 


1077 


5 1 5 O02_region_G2_2646_ 1 5_Forward_Primer 


2553 


2577 


1078 


515O02_region_G2_2646_15_Reverse_Primer 


2809 


2784 


1079 


5 1 5O02_region_G2_76652_24_Forward_Primer 


76567 


76591 


1080 


515O02_region_G2_76652_24_Reverse_Primer 


76835 


76812 


1081 


515O02_region_G2_66280_14_Forward_Primer 


66052 


66077 


1082 


515O02_region_G2_66280_14_Reverse_Primer 


66334 


66309 


1083 


515O02_region_G2_54768_13_Forward_Primer 


54640 


54666 


1084 


515002 region G2 54768 13 ReverseJPrimer 


54923 


54899 


1085 


515O02_region_G2_62580_14_Forward_Primer 


62552 


62576 


1086 


515O02_region_G2_62580_14_Reverse_Primer 


62840 


62816 


1087 


515O02_region_G2_34598_55_Forward_Primer 


34473 


34497 


1088 


515O02_region_G2_34598_55_Reverse_Primer 


34765 


34739 


1089 


515O02_region_G2_77680_13_Forward_Primer 


77444 


77470 


1090 


5 15O02_region_G2_77680_l 3_Reverse_Primer 


77741 


77716 


1091 


5 1 5O02_region_G2_77693_ 1 2_For ward_Primer 


77444 


77470 


1092 


515O02_region_G2_77693_12_Reverse_Primer 


77741 


77716 


1093 


515O02_region_G2_97392_14_Forward_Primer 


97255 


97280 


1094 


515O02_region_G2_97392_14_Reverse_Primer 


97554 


97530 


1095 


515O02_region_G2_97359_15_Forward_Primer 


97255 


97280 


1096 


515O02_region_G2_97359_15_Reverse_Primer 


97554 


97530 



Seq Num 


Seq ID 






1120 


consensusLRR 






1121 


rhglLRR 
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Seq Num 


Seq ID 






1122 


Rhg4LRR 







Seq Num 


Seq ID 


Primer location on 240O17_region_G3 




1123 


2400 1 7_region_G3_forward_ l_b 


45046-45072 





DETAILED DESCRIPTION OF THE INVENTION 

A) rhgl 

The present invention provides a method for the production of a soybean plant 
having an rhgl SCN resistant allele comprising: (A) crossing a first soybean plant having 
an rhgl SCN resistant allele with a second soybean plant having an rhgl SCN sensitive 
allele to produce a segregating population; (B) screening the segregating population for a 
member having an rhgl SCN resistant allele with a first nucleic acid molecule capable of 
specifically hybridizing to linkage group G, wherein the first nucleic acid molecule 
specifically hybridizes to a second nucleic acid molecule that is linked to the rhgl SCN 
resistant allele; and, (C) selecting the member for further crossing and selection. 

rhgl is located on linkage group G (Concibido et al, Crop Sci. 3(5:1643-1650 
(1996)). SCN resistant alleles of rhgl provide partial resistance to SCN races 1, 2, 3, 5, 6, 
and 14 (Concibido et al. {Crop Sci. 37:258-264 (1997)). Also, Webb (U.S. Patent 
5,491,081) reports that a QTL on linkage group G (rhgl) provides partial resistance to 
SCN races 1, 2, 3, 5, and 14. rhgl and RhgA provide complete or nearly complete 
resistance to SCN race 3 (U.S. Patent 5,491,081). While initially thought to be a 
recessive gene, rhgl classification as a recessive gene has been questioned. 

Using bioinformatic approaches, the rhgl coding region is predicted to contain 
either four exons (rhgl, v.i)(coding coordinates 45163-45314, 45450-45509, 46941- 
48763, and 48975-49573 of SEQ ID NO: 2) or two exons (rhgl, v.2) (coding coordinates 
46798-48763 and, 48975-49573 of SEQ ID NO: 3). rhgl, v.l encodes an 877 amino acid 
polypeptide, rhgl, v.2 encodes an 854 amino acid length polypeptide, rhgl codes for a 
Xa21-like receptor kinase (SEQ ID NOs: 1097, 1098, and 1100-1115) (Song, et al.,. 
Science 270, 1804-1806(1995)). rhgl has an extracellular leucine rich repeat (LRR) 
domain (rhgl, v.l, SEQ ID NO: 1097, residues 164-457; rhgl, v.l, SEQ ID NO: 1098, 
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residues 141-434), a transmembrane domain (rhgl, v.l, SEQ ID NO: 1097, residues 508- 
530; rhgl, v.l, SEQ ID NO: 1098, residues 33-51, and 485-507) and serine/threonine 
protein kinase (STK) domain {rhgl, v.l, SEQ ID NO: 1097, residues 578-869; rhgl, v.l, 
SEQ ID NO: 1098, residues 555-846). In a preferred embodiment, the LRR domain has 
multiple LRR repeats. In a more preferred embodiment, the LRR domain has 12 LRR 
repeats. 

To identify proteins similar to the proteins encoded by rhgl candidates, database 
searches are performed using the predicted peptide sequences. The rhgl candidate shows 
similarity to CAA18124, which is the Arabidopsis putative receptor kinase (58.4% simi- 
larity and 35.8% identity, (CLUSTALW (default parameters), Thompson et al, Nucleic 
Acids Res. 22:4673-4680 (1994)), GCG package, Genetics Computer Group, Madison, 
Wisconsin), and the apple leucine-rich receptor-like protein kinase (g3641252) (53.2% 
similarity and 31.5% identity, (CLUSTALW (default parameters))), which has both LLR 
and STK domains, showing conservation in both the LLR and STK domains. The pre- 
dicted LRR extracellular domain shows similarity to the tomato resistance genes Cf-l.l 
(Lycopersicon pimpinellifolium) (66.9% similarity and 45.4% identity (CLUSTALW 
(default parameters))) and C/-2.2 (Lycopersicon pimpinellifolium) (66.9% similarity and 
45.4% identity (CLUSTALW (default parameters))). 

Figure 1 is an alignment of the LRR domain of the rhgl gene. A consensus se- 
quence for the LRR is shown as the top row of the alignment. Each row of amino acids 
represents an LRR domain. The boxed region indicates the putative |3-turn/ |3-sheet 
structural motif postulated to be involved in ligand binding (Jones et al.,Adv. Bot. Res. 
Incorp. Adv. Plant Path. 24;89-167 (1997)). The hydrophobic leucine residues are 
thought to project into the core of the protein while the flanking amino acids are thought 
to be solvent exposed where they may interact with the ligand (Kobe et al, Nature 
374:183-186 (1995)). Non-conservative changes in this region are thought to affect fold- 
ing. An "x" represents an arbitrary amino acid while an "a" represents a hydrophobic res- 
idue (leucine, isoleucine, methionine, valine, or phenylalanine). Amino acid substitutions 
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between resistant and sensitive phenotypes are bordered by a double line. The amino acid 
substitution within the 302-325 region is a histidine/asparagine substitution, and the 
amino acid substitution within the 422-445 region is a phenylalanine/serine substitution. 

As used herein, a naturally occurring rhgl allele is any allele that encodes for a 
protein having an extracellular LRR, a transmembrane domain, and STK domain where 
the naturally occurring allele is present on linkage group G and where certain rhgl alleles, 
but not all rhgl alleles, are capable of providing or contributing to resistance or partial 
resistance to a race of SCN. It is understood that such an allele can, using for example 
methods disclosed herein, be manipulated so that the nucleic acid molecule encoding the 
protein is no longer present on linkage group G. It is also understood that such an allele 
can, using for example methods disclosed herein, be manipulated so that the nucleic acid 
molecule sequence is altered. 

As used herein, an rhgl SCN resistant allele is any rhgl allele where that allele 
alone or in combination with other SCN resistant alleles present in the plant, such as an 
Rhg4 SCN resistant allele, provides resistance to a race of SCN, and that resistance is due, 
at least in part, to the genetic contribution of the rhgl allele. 

SCN resistance or partial resistance is determined by a comparison of the plant in 
question with a known SCN sensitive host, Lee 74, according to the method set forth in 
Sclimitt, J. Nematol. 20:392-395 (1988). As used herein, resistance to a particular race of 
SCN is defined as having less than 10% of cyst development relative to the SCN sensitive 
host Lee 74. Moreover, as used herein, partial resistance to a particular race of SCN is 
defined as having more than 10% but less than 75% of cyst development relative to the 
SCN sensitive host Lee 74. 

Any soybean plant having an rhgl SCN resistant allele can be used in conjunc- 
tion with the present invention. Soybeans with known rhgl SCN resistant alleles can be 
used. Such soybeans include but are not limited to PI548402 (Peking), PI200499, A2869, 
Jack, A2069, PI209332 (No:4), PI404166 (Krasnoaarmejkaja), PI404198 (Sun huan do), 
PI437654 (Er-hej-jan), PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), 
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PI548988 (Pickett), PI84751, PI437654, PI40792, Pyramid, Nathan, AG2201, A3469, 
AG3901, A3904, AG4301, AG4401, AG4501, AG4601, PION9492, PI88788, Dyer, 
Custer, Manokin, and Doles. In a preferred aspect, the soybean plant having an rhgl 
SCN resistant allele is an rhgl haplotype 2 allele. Examples of soybeans with an rhgl 
haplotype 2 allele are PI548402 (Peking), PI404166 (Krasnoaarmejkaja), PI404198 (Sun 
huan do), PI437654 (Er-hej-jan), PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 
(Forrest), PI548988 (Pickett), PI84751, PI437654, and PI40792. In addition, using the 
methods or agents of the present invention, soybeans and wild relative of soybean such as 
Glycine soja can be screened for the presence of rhgl SCN resistant alleles. 

Any soybean plant having an rhgl SCN sensitive allele can be used in conjunc- 
tion with the present invention. Such soybeans include A3244, A2833, AG3001, Wil- 
liams, Will, A2704, Noir, DK23-51, Lee 74, Essex, Minsoy, A1923, and Hutcheson. In a 
preferred aspect, the soybean plant having an rhgl SCN sensitive allele is an rhgl A3244 
allele. In addition, using the methods or agents of the present invention, soybeans and 
wild relatives of soybean such as Glycine soja can be screened for the presence of rhgl 
SCN sensitive alleles. 

Table 2, below, is a table showing single nucleotide polymorphisms (SNPs) and 
insertions/deletions (1NDEL) sites for eight haplotype sequences of rhgl. 
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In Table 2, discrete haplotypes are designated 1 through 8. N/A refers to a 
haplotype that is not characterized. The Plant Introduction classification number is 
indicated in the "PI#" column. A dash indicates that no PI number is known or assigned 
for the line under investigation. The line from which the sequences are derived is 
indicated in the "line" column, with a dash indicating an unknown or unnamed line. The 
"Ph." (phenotype) column of table 2 indicates whether a given line has been reported as 
resistant (R) to at least one race of SCN or sensitive (S). 

The nucleotide bases located at each of 30 positions in each of the haplotype 
sequences is shown in the columns labeled "Base number of contig 240O17_region_G3 
of reference line A3244." The base number at the top of each column corresponds to the 
base number in contig 240O17_region_G3 of reference line A3224 (SEQ ID NOs: 2 and 
3). The letters G, A, C, and T correspond to the bases guanine, adenine, cytosine, and 
thymine. Two bases separated by a slash (A/G, C/T, or T/C) indicate uncertainty at the 
specified position of the haplotype sequence. A "d" followed by a number indicates a 
deletion of a the length specified. That is, dl is a one base deletion, d2 is a two base 
deletion, d 14 is a fourteen base deletion, and dl9 is a nineteen base deletion. A zero (0) 
indicates no deletion. A dash indicates that the identity of the base is undetermined. 

Examination of table 2 reveals that the amino acid substitutions in the rhgl cod- 
ing region are common to the resistant lines PI467312 (Cha-mo-shi-dou), PI88788 and 
the southern susceptible lines Essex, Hutchenson, Noir and A1923. As used herein, a 
"southern" cultivar is any cultivar from maturity groups VI, VII, VIII, IX, or X, and a 
"northern" cultivar is any cultivar from maturity groups 000, 00, 0, 1, II, III, IV, or V. 
This data is consistent with the mapping experiments of Qui et al. (TheorAppl Genet 
98:356-364 (1999)). Based on analysis of 200 F 2: 3 families derived from a cross between 
Peking and Essex, the authors failed to detect any significant association with SCN re- 
sistance to races 1, 2, and 3, and the rhgl locus on linkage group G. The authors point 
out that one of the markers, Bngl22, which has been shown to have significant linkage to 
rhgl (Concibido et al, Crop Sci. 36:1643-1650 (1996)), is not polymorphic in the 
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population employed. It is also possible that the susceptible southern lines contain rhgl 
and the susceptible phenotype reflects the polygenic nature of SCN resistance. In a study 
to uncover QTLs for sudden death syndrome (SDS) in soybean, two SCN resistant alleles 
originating from the susceptible parent Essex have been described (Hnetkovsky et al, 
Crop Sci. 36:393-400). 

Tables 3a, 3b, and 3c, below, show lines that share an rhgl haplotype. 



TABLE 3a 



Haplotype 2 Lines 


PI# 


Line 


Ph. 


PI548402 


Peking 


R 


PI404166 


Krasnoaarmej kaj a 


R 


PI404198 


(Sun huan do) 


R 


PI437654 


Er-hej-jan 


R 


PI438489 


(Chiquita) 


R 


PI507354 


Tokei 421 


R 


PI548655 


Forrest 


R 


PI548988 


Pickett 


R 


PI84751 




R 


PI437654 




R 


PI40792 






TABLE 3b 


Haplotype 4 Lines 


PI# 


Line 


Ph. 




Will 


S 


PI467312 


Cha-mo-shi-dou 


R 


PI88788 




R 


TABLE 3c 


Haplotype 6 Lines 


PI# 


Line 


Ph. 




Noir 


S 




A1923 


S 




Hutcheson 


S 



In Tables 3a, 3b, and 3c, Plant Introduction classification number is indicated in 
the "PI#" column. A dash indicates that no PI number is known or assigned for the line 
in question. The line from which the sequences are derived is indicated in the "line" 
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column, with a dash indicating an unknown or unnamed line. The "Ph." column indicates 
whether a given line has been reported as resistant (R) to at least one race of SCN or 
sensitive (S), with a dash indicating that the phenotype is unknown. 

In a preferred aspect, the source of either an rhgl SCN sensitive allele or an rhgl 
SCN resistant allele, or more preferably both, is an elite plant. An "elite line" is any line 
that has resulted from breeding and selection for superior agronomic performance. 
Examples of elite lines are lines that are commercially available to farmers or soybean 
breeders such as HARTZ™ variety H4994, HARTZ™ variety H5218, HARTZ™ variety 
H5350, HARTZ™ variety H5545, HARTZ™ variety H5050, HARTZ™ variety H5454, 
HARTZ™ variety H5233, HARTZ™ variety H5488, HARTZ™ variety HLA572, 
HARTZ™ variety H6200, HARTZ™ variety H6104, HARTZ™ variety H6255, 
HARTZ™ variety H6586, HARTZ™ variety H6191, HARTZ™ variety H7440, 
HARTZ™ variety H4452 Roundup Ready™, HARTZ™ variety H4994 Roundup 
Ready™, HARTZ™ variety H4988 Roundup Ready™, HARTZ™ variety H5000 
Roundup Ready™, HARTZ™ variety H5147 Roundup Ready™, HARTZ™ variety 
H5247 Roundup Ready™, HARTZ™ variety H5350 Roundup Ready™, HARTZ™ 
variety H5545 Roundup Ready™, HARTZ™ variety H5855 Roundup Ready™, 
HARTZ™ variety H5088 Roundup Ready™, HARTZ™ variety H5164 Roundup 
Ready™, HARTZ™ variety H5361 Roundup Ready™, HARTZ™ variety H5566 
Roundup Ready™, HARTZ™ variety H5181 Roundup Ready™, HARTZ™ variety 
H5889 Roundup Ready™, HARTZ™ variety H5999 Roundup Ready™, HARTZ™ 
variety H6013 Roundup Ready™, HARTZ™ variety H6255 Roundup Ready™, 
HARTZ™ variety H6454 Roundup Ready™, HARTZ™ variety H6686 Roundup 
Ready™, HARTZ™ variety H7152 Roundup Ready™, HARTZ™ variety H7550 
Roundup Ready™, HARTZ™ variety H8001 Roundup Ready™ (HARTZ SEED, 
Stuttgart, Arkansas, U.S.A.); A0868, AG0901, A1553, A1900, AG1901, A1923, A2069, 
AG2101, AG2201, A2247, AG2301, A2304, A2396, AG2401, AG2501, A2506, A2553, 
AG2701, AG2702, A2704, A2833, A2869, AG2901, AG2902, AG3001, AG3002, 
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A3204, A3237, A3244, AG3301, AG3302, A3404, A3469, AG3502, A3559, AG3601, 
AG3701, AG3704, AG3750, A3834, AG3901, A3904, A4045 AG4301, A4341, AG4401, 
AG4501, AG4601, AG4602, A4604, AG4702, AG4901, A4922, AG5401, A5547, 
AG5602, A5704, AG5801, AG5901, A5944, A5959, AG6101, QR4459 and QP4544 
(Asgrow Seeds, Des Moines, Iowa, U.S A.); DeKalb variety CX445 (DeKalb, Illinois). 
An elite plant is any plant from an elite line. 
B) RhgA 

The present invention provides a method for the production of a soybean plant 
having an RhgA SCN resistant allele comprising: (A) crossing a first soybean plant having 
an RhgA SCN resistant allele with a second soybean plant having an RhgA SCN sensitive 
allele to produce a segregating population; (B) screening the segregating population for a 
member having an RhgA SCN resistant allele with a first nucleic acid molecule capable of 
specifically hybridizing to linkage group A2, wherein the first nucleic acid molecule 
specifically hybridizes to a second nucleic acid molecule linked to the RhgA SCN resistant 
allele; and, (C) selecting the member for further crossing and selection. 

RhgA is located on linkage group A2 (Matson et al, Crop Sci. 5:447 (1965)). 
SCN resistant alleles of RhgA provide partial resistance to SCN races 1 and 3 (U.S. Patent 
5,491,081). Together, rhgl and RhgA provide complete or nearly complete resistance to 
SCN race 3. The dominant gene, RhgA, was found to be closely linked to the seed coat 
color locus (i ) (Matson et al., Crop Sci. 5:447 (1965)). The i locus in Peking was also 
reported to be linked with a recessive gene for resistance to SCN (Sugiyama et al, Jpn. J. 
Breed. 76:83-86 (1966)). It is possible that RhgA and the recessive gene linked to the i 
locus are one and the same, which would call into question the classification of Rhg4 as a 
dominant gene. 

Using bioinformatic approaches the RhgA coding region is predicted to contain 2 
exons (coding coordinates 111805-113968 and 114684-115204 of SEQ ID NO: 4). RhgA 
encodes an 894 amino acid polypeptide. RhgA codes for a Xa21-like receptor kinase 
(SEQ ID NOs: 1099 and 1116-1119) (Song et al, Science 270, 1804-1806, (1995)). 



48 



WO 01/51627 



PCT7US01/00552 



Rhg4 has an extracellular LRR domain (Rhg4, SEQ ID NO: 1099, residues 34-44), a 
transmembrane domain (Rhg4 SEQ ID NO: 1099, residues 449-471), and STK domain 
(Rhg4, SEQ ID NO: 1099, residues 531-830). In a preferred embodiment, the LRR 
domain has multiple LRR repeats. In a more preferred embodiment, the LRR domain has 
12 LRR repeats. 

To identify proteins similar to the Rhg4 candidate, database searches are 
performed using the predicted peptide sequences. The Rhg4 candidate shows similarity to 
TMK (Y07748)(73.0% similarity and 54.8% identity (CLUSTALW (default 
parameters))) and TMK1 PRECURSOR (70.6% similarity and 55.1% identity 
(CLUSTALW (default parameters))), which are rice and Arabidopsis receptor kinases, 
respectively. The predicted LRR extracellular domain reveals similarity to TMK 
(Y07748)(70.1% similarity and 46.6% identity (CLUSTALW (default parameters))), 
TMK1 PRECURSOR (gl707642) (65.8% similarity and 48.8% identity (CLUSTALW 
(default parameters))), and F21J9.1 (g2213607) (65.5% similarity and 45.6% identity 
(CLUSTALW (default parameters))). 

Figure 2 is an alignment of the LRR domain of the Rhg4 gene. A consensus 
sequence is shown as the top row. Each row of amino acids represents an LRR domain. 
The boxed region indicates the putative p-turn/ P-sheet structural motif postulated to be 
involved in ligand binding (Jones et at, Adv. Bot. Res. Incorp. Adv. Plant Path. 24;89- 
167 (1997)). The hydrophobic leucine residues are thought to project into the core of the 
protein while the flanking amino acids are thought to be solvent exposed where they may 
interact with the ligand (Kobe et al, Nature 374; 183-186 (1995)). An "x" represents an 
arbitrary amino acid while an "a" represents a hydrophobic residue (leucine, isoleucine, 
methionine, valine, or phenylalanine). Amino acid substitutions between resistant and 
sensitive phenotypes are bordered by a double line. The amino acid substitution within 
the 35-57 region is a histidine/glutamine substitution, and the amino acid substitution 
within the 81-104 region is a leucine/phenylalanine substitution. 
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As used herein, a naturally-occurring RhgA allele is any allele that encodes for a 
protein having an extracellular LRR domain, a transmembrane domain, and STK domain 
where the naturally occurring allele is present on linkage group A2 and where certain 
RghA alleles, but not all RghA alleles, are capable of providing or contributing to 
resistance or partial resistance to a race of SCN. It is understood that such an allele can, 
using, for example methods disclosed herein, be manipulated so that the nucleic acid 
molecule encoding the protein is no longer present on linkage group A2. It is also 
understood that such an allele can, using, for example methods disclosed herein, be 
manipulated so that the nucleic acid molecule sequence is altered. 

As used herein, an RhgA SCN resistant allele is any RhgA allele where that allele 
alone or in combination with other SCN resistant alleles present in the plant, such as an 
rhgl SCN resistant allele, provides resistance to a race of SCN, and that resistance is due, 
at least in part, to the genetic contribution of the RhgA allele. 

Any soybean plant having an RhgA SCN resistant allele can be used in 
conjunction with the present invention. Soybeans with known RhgA SCN resistant alleles 
can be used. Such soybeans include, but are not limited to, PI548402 (Peking), PI437654 
(Er-hej-jan), PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), PI548988 
(Pickett), PI88788, PI404198 (Sun Huan Do), PI404166 (Krasnoaarmejkaja), Hartwig, 
Manokin, Doles, Dyer, and Custer. In a preferred aspect, the soybean plant having an 
RhgA SCN resistant allele is an RhgA haplotype 3 allele in a plant having either an rhgl 
haplotype 2 or rhgl haplotype 4 allele. Examples of soybeans with an RhgA haplotype 3 
allele are PI548402 (Peking), PI88788, PI404198 (Sun huan do), PI438489 (Chiquita), 
PI437654 (Er-hej-jan), PI404166 (Krasnoaarmejkaja), PI548655 (Forrest), PI548988 
(Pickett), and PI507354 (Tokei 421). In addition, using the methods or agents of the 
present invention, soybeans and wild relatives of soybeans such as Glycine soja can be 
screened for the presence of RhgA SCN resistant alleles. 

Table 4 below is a table showing single nucleotide polymorphisms (SNPs) for 
three haplotype sequences of RhgA. 
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TABLE 4 



Identification 




Base number of contig 
318013 region A3 


Markers 


Hap 


PI number 


Line 


Ph 


Coat 


111933 


112065 


112101 


112461 


114066 


scn279 


scnb267 


scn273 






A2069 


R 


yellow 


T 


A 


T 


A 


T 


2 


2 


2 






A2869 


R 


yellow 


T 


A 


T 


A 


T 


2 


2 


2 






A3 244 


s 


yellow 


T 


A 


T 


A 


T 


2 


2 


2 




PI87631 




R 


yellow 


T 


A 


T 


A 


T 




2 


2 




PB48389 


Minsoy 


S 


yellow 


T 


A 


T 


A 


T 


2 


2 


2 


1 


PI5 18664 


Hutcheson 


s 


yellow 


T 


A 


T 


A 


T 


2 


2 


2 


1 


PI548658 


Lee 74 


s 


yellow 


T 


A 


T 


A 


T 




2 


2 


2 


PI540556 




R 


yellow 


G 


A 


T 


A 


T 


2 


2 




2 


PI360843 


Oshimashirome 


R 




G 


A 


T 


A 


T 








2 


PI423871 


Toyosuzu 


R 


yellow 


G 


A 


T 


A 


T 








3 


PI548402 




R 


black 


G 


C 


C 


T 


G 


j 


! 




3 


PI88788 




R 


black 


G 


C 


C 


T 


G 




j 


j 




PI404198 B 


(Sun huan do) 


R 




G 


c 


C 


T 


G 


1 


j 


j 




PI438489 B 


(Chiquita) 


R 


black 


G 


c 


C 


T 


G 




j 


j 


3 


PI437654 


Er-hej-jan 


R 


black 


G 


c 


C 


T 


G 


2 






3 


PI404166 


Krasnoaarniejkaja 


R 


black 


G 


c 


C 


T 


G 




! 




3 


PI290136 




S 


black 


G 


c 


C 


T 


G 


! 


! 


! 


3 


PI548655 




R 


yellow 


G 


c 


c 


T 


G 








3 


PI548988 


Pickett 


R 


yellow 


G 


c 


c 


T 


G 


1 






3 


PI507354 


Tokei 421 


R 


yellow 


G 


c 


c 


T 


G 


1 






N/A 


PI467312 


Cha-mo-shi-dou 


R 


GnBr 


G 


c 


c 


T 




1 






N/A 


PI209332 


No.4 


R 




T 


A 


T 






2 


2 


2 


N/A 


PI5 18672 


Will 


S 


yellow 


T 


A 


T 




T 


2 


2 


2 


N/A 


PI548667 


Essex 


S 




T 


A 


T 




T 


2 


2 


2 



In Table 4, discrete haplotypes are designated 1 through 3. N/A refers to a 
haplotype that is not characterized. In Table 4, the Plant Introduction classification 
number is indicated in the "PI#" column. A dash indicates that no PI number is known or 
assigned for the line under investigation. The line from which the sequences are derived 
is indicated in the "line" column, with a dash indicating an unknown or unnamed line. 
The "Ph." column of Table 4 indicates whether a given line has been reported to be 
resistant (R) to at least one race of SCN, or sensitive (S). The "coat" column shows the 
phenotypic coat color of a seed as either yellow, black, green/brown (GnBr), or 
unknown/unassigned (dash). At the / locus, black seeded varieties harbor the i allele for 
black or imperfect black seed coat. In a preferred embodiment, the seed has a yellow 
coat. 

The nucleotide base located at each of 5 positions in each of the haplotype 
sequences is shown in the columns labeled "Base number of contig 318013_region_A3." 
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The base number at the top of each column correspond to the base number in the contig 
318013_region_A3 of reference line A3244 (SEQ ID NO: 4). The letters G, A, C, and T 
correspond to the bases guanine, adenine, cytosine, and thymine. A dash indicates that 
the identity of the base is unknown. 

Three different simple sequence repeat (SSR) or microsatellite markers that occur 
within the sequences, scn279 (SEQ ID NO: 292), scn267 (SEQ ID NO: 282), and scn273 
(SEQ ID NO: 294), are listed under "markers." The allele of each marker occurring in a 
haplotype is indicated by a 1 or a 2, with a dash indicating that the information is not 
determined. 

Any soybean plant having an RhgA SCN sensitive allele can be used in 
conjunction with the present invention. Such soybeans include A3244, Will, Noir, Lee 
74, Essex, Minsoy, A2704, A2833, AG3001, Williams, DK23-51, and Hutcheson. In a 
preferred aspect, the soybean plant having an RhgA SCN sensitive allele is an RhgA 
A3244 allele, hi addition, using the methods or agents of the present invention, soybeans 
and wild relative of soybean such as Glycine soja can be screened for the presence of 
RhgA SCN sensitive alleles. 

In a preferred aspect, the source of either an RhgA SCN sensitive allele or an 
RhgA SCN resistant allele, or more preferably both, is an elite plant. 

In table 5, below, rhgl and RhgA haplotypes for various cultivars are compared. 



TABLE 5 



Identification 


Haplotype 


PI# 


Line 


Coat 


Ph. 


rhgA 


rhgl 




A3244 


yellow 


S 


1 


1 


PI548402 


Peking 


black 


R 


3 


2 


PI404198 B 


(Sun huan do) 


black 


R 


3 


2 


PI438489 B 


(Chiquita) 


black 


R 


3 


2 


PI437654 


Er-hej-jan 


black 


R 


3 


2 


PI404166 


Krasnoaarmejkaja 


black 


R 


3 


2 


PI548655 


Forrest 


yellow 


R 


3 


2 


PI548988 


Pickett 


yellow 


R 


3 


2 


PI507354 


Tokei 421 


yellow 


R 


3 


2 


PI88788 




black 


R 


3 


4 



52 



WO 01/51627 



PCT7US01/00552 



Identification 


Haplotype 


PI# 


Line 


Coat 


Ph. 


rhg4 


rhgl 


PI467312 


Cha-mo-shi-dou 


GnBr 


R 


N/A 


4 




Noir 


black 


S 


3 


6 




Jack 


yellow 


R 


2 


N/A 


PI360843 


Oshimashirome 


yellow 


R 


2 


N/A 


PI423871 


Toyosuzu 


yellow 


R 


2 


3 


PI209332 


No.4 


black 


R 


N/A 


N/A 


PI87631 


Kindaizu 


yellow 


R 


1 






Minsoy 


yellow 


S 


1 


N/A 




Will 


yellow 


S 


N/A 


4 




Hutcheson 


yellow 


s 


1 


6 




Lee 74 


yellow 


s 


N/A 


7 




Essex 


yellow 


s 


N/A 


N/A 




A2069 


yellow 


R 


1 


N/A 




A2869 


yellow 


R 


1 


N/A 



In Table 5, haplotypes, as used in Tables 2 through 4, are listed for each line. 
N/A refers to a haplotype that is not characterized. The Plant Introduction classification 
number is indicated in the "PI#" column. A dash indicates that no PI number is known or 
assigned for the line under investigation. The line from which the sequences are derived 
is indicated in the "line" column, with a dash indicating an unknown or unnamed line. 
The "Ph." column of table 5 indicates whether a given line has been reported to be 
resistant (R) to at least one race of SCN, or sensitive (S). The "coat" column shows the 
phenotypic coat color of a seed as either yellow, black, green/brown (GnBr), or 
unknown/unassigned (dash). At the / locus, black seeded varieties harbor the i allele for 
black or imperfect black seed coat. In a preferred embodiment, the seed has a yellow 
coat. 

Screening for rhgl and Rhg4 alleles 

Any appropriate method can be used to screen for a plant having an rhgl SCN 
resistant allele. Any appropriate method can be used to screen for a plant having an Rhg4 
SCN resistant allele. In a preferred aspect of the present invention, a nucleic acid marker 
of the present invention can be used (see section entitled "Screening for rhgl and RhgA 
alleles" and subsection (ii) of the section entitled "Agents"). 
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Additional markers, such as SSRs, AFLP markers, RELP markers, RAPD 
markers, phenotypic markers, SNPs, isozyme markers, microarray transcription profiles 
that are genetically linked to or correlated with alleles of a QTL of the present invention 
can be utilized (Walton, Seed World 22-29 (July, 1993); Burow et al, Molecular- 
Dissection of Complex Traits, 13-29, Eds. Paterson, CRC Press, New York (1988)). 
Methods to isolate such markers are known in the art. For example, locus-specific SSRs 
can be obtained by screening a genomic library for SSRs, sequencing of "positive" 
clones, designing primers which flank the repeats, and amplifying genomic DNA with 
these primers. The size of the resulting amplification products can vary by integral 
numbers of the basic repeat unit. To detect a polymorphism, PCR products can be 
radiolabeled, separated on denaturing polyacrylamide gels, and detected by 
autoradiography. Fragments with size differences >4 bp can also be resolved on agarose 
gels, thus avoiding radioactivity. 

Other SSR markers may be utilized. Amplification of simple tandem repeats, 
mainly of the [CA] n type were reported by Litt et al.,Amer. J. Human Genet. 44:397-401 
(1989); Smeets etal, Human Genet. 53:245-251 (1989); Tautz, Nucleic Acids Res. 
17:6463-6472 (1989); Weber etal, Am. J. Hum. Genet. 44:388-396 (1989). Weber, 
Genomics 7:524-530 (1990), reported that the level of polymorphism detected by PCR- 
amplified [CA] n type SSRs depends on the number of the "perfect" (?'.<?., uninterrupted), 
tandemly repeated motifs. Below a certain threshold (i.e., 12 CA-repeats), the SSRs were 
reported to be primarily monomorphic. Above this threshold, however, the probability of 
polymorphism increases with SSR length. Consequently, long, perfect arrays of SSRs are 
preferred for the generation of markers, i.e., for the design and synthesis of flanking 
primers. 

Suitable primers can be deduced from DNA databases (e.g., Akkaya et al., 
Genetics. 752:1131-1139 (1992)). Alternatively, size-selected genomic libraries (200 to 
500 bp) can be constructed by, for example, using the following steps: (1) isolation of 
genomic DNA; (2) digestion with one or more 4 base-specific restriction enzymes; (3) 
size-selection of restriction fragments by agarose gel electrophoresis, excision and 
purification of the desire size fraction; (4) ligation of the DNA into a suitable vector and 
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transformation into a suitable E. coli strain; (5) screening for the presence of SSRs by 
colony or plaque hybridization with a labeled probe; (6) isolation of positive clones and 
sequencing of the inserts; and (7) design of suitable primers flanking the SSR. 

Establishing libraries with small, size-selected inserts can be advantageous for 
SSR isolation for two reasons: (1) long SSRs are often unstable in E. coli, and (2) 
positive clones can be sequenced without subcloning. A number of approaches have been 
reported for the enrichment of SSRs in genomic libraries. Such enrichment procedures 
are particularly useful if libraries are screened with comparatively rare tri- and 
tetranucleotide repeat motifs. One such approach has been described by Ostrander et al, 
Proc. Natl. Acad. Sci. (U.S.A). 59:3419-3423 (1992), who reported the generation of a 
small-insert phagemid library in an E. coli strain deficient in UTPase (d8t) and uracil-N- 
glycosylase (ung) genes, hi the absence of UTPase and uracil-N-glycosylase, dUTP can 
compete with dTTP for the incorporation into DNA. Single-stranded phagemid DNA 
isolated from such a library can be primed with [CA] n and [TG] n primers for second 
strand synthesis, and the products used to transform a wild-type E. coli strain. Since 
under these conditions there will be selection against single-stranded, uracil-containing 
DNA molecules, the resulting library will consist of primer-extended, double-stranded 
products and an about 50-fold enrichment in CA-repeats. 

Other reported enrichment strategies rely on hybridization selection of simple 
sequence repeats prior to cloning (Karagyozov et al, Nucleic Acids Res. 21:39 11-3912 
(1993); Armour et al, Hum. Mol. Gen. 3:599-605 (1994); Kijas et al, Genome 35:349- 
355 (1994); Kandpal et al, Proc. Natl. Acad. Sci, (U.S.A.) 97:88-92 (1994); Edwards et 
al, Am. J. Hum. Genet. 49:746-756 (1991)). Hybridization selection, can for example, 
involve the following steps: (1) genomic DNA is fragmented, either by sonication, or by 
digestion with a restriction enzyme; (2) genomic DNA fragments are ligated to adapters 
that allow a "whole genome PCR" at this or a later stage of the procedure; (3) genomic 
DNA fragments are amplified, denatured and hybridized with single-stranded SSR 
sequences bound to a nylon membrane; (4) after washing off unbound DNA, hybridizing 
fragments enriched for SSRs are eluted from the membrane by boiling or alkali treatment, 
reamplified using adapter-complementary primers, and digested with a restriction enzyme 
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to remove the adapters; and (5) DNA fragments are ligated into a suitable vector and 
transformed into a suitable E. coli strain. SSRs can be found in up to 50-70% of the 
clones obtained from these procedures (Armour et al, Hum. Mol. Gen. 3:599-605 (1994); 
Edwards et al, Am. J. Hum. Genet. ¥9:746-756 (1991)). 

An alternative hybridization selection strategy was reported by Kijas et al, 
Genome 38:599-605 (1994), which replaced the nylon membrane with biotinylated, SSR- 
complementary oligonucleotides attached to streptavidin-coated magnetic particles. SSR- 
containing DNA fragments are selectively bound to the magnetic beads, reamplified, 
restriction-digested and cloned. 

It is further understood that other additional markers on linkage group G or A2 
may be utilized (Morgante et al, Genome 37:763-769 (1994)). As used herein, reference 
to the linkage group of G or A2 refers to the linkage group that corresponds to linkage 
groups U5 and U3, respectively from the genetic map of Glycine max (Mansur et al, 
Crop Set 36: 1327-1336 (1996), and linkage groups G and A2, respectively, of Glycine 
max x. Glycine soja (Shoemaker et al, Genetics 144: 329-336 (1996)) that is present in 
Glycine soja (Soybase, an Agricultural Research Service, United States Department of 
Agriculture (http://129.186.26.940/ and USDA - Agricultural Research Service: 
http://www.ars.usda.gov/)). 

PCR-amplified SSRs can be used, because they are locus-specific, codominant, 
occur in large numbers and allow the unambiguous identification of alleles. Standard 
PCR-amplified SSR protocols use radioisotopes and denaturing polyacrylamide gels to 
detect amplified SSRs. In many situations, however, allele sizes are sufficiently different 
to be resolved on high percentage agarose gels in combination with ethidium bromide 
staining (Bell et al, Genomics 79:137-144 (1994); Becker et al, Genome 38:991-998 
(1995); Huttel, Ph.D. Thesis, University of Frankfurt, Germany (1996)). High resolution 
without applying radioactivity is also provided by nondenaturing polyacrylamide gels in 
combination with either ethidium bromide (Scrimshaw, Biotechniques 73:2189 (1992)) or 
silver straining (Klinkicht et al, Molecular Ecology 1: 133-134 (1992); Neilan et al, 
Biotechniques 77:708-712 (1994)). An alternative of PCR-amplified SSRs typing 
involves the use of fluorescent primers in combination with a semi-automated DNA 
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sequencer (Schwengel et al, Genomics 22:46-54 (1994)). Fluorescent PCR products can 
be detected by real-time laser scanning during gel electrophoresis. An advantage of this 
technology is that different amplification reactions as well as a size marker (each labeled 
with a different fluorophore) can be combined into one lane during electrophoresis. 
Multiplex analysis of up to 24 different SSR loci per lane has been reported (Schwengel 
et al, Genomics 22:46-54 (1994)). 

The detection of polymorphic sites in a sample of DNA may be facilitated 
through the use of nucleic acid amplification methods. Such methods specifically 
increase the concentration of polynucleotides that span the polymorphic site, or include 
that site and sequences located either distal or proximal to it. Such amplified molecules 
can be readily detected by gel electrophoresis or other means. 

The most preferred method of achieving such amplification employs the 
polymerase chain reaction ("PCR") (Mullis et al, Cold Spring Harbor Symp. Quant. Biol. 
57:263-273 (1986); Erlich et al, European Patent Appln. 50,424; European Patent Appln. 
84,796, European Patent Application 258,017, European Patent Appln. 237,362; Mullis, 
European Patent Appln. 201,184; Mullis et al, U.S. Patent No. 4,683,202; Erlich, U.S. 
Patent No. 4,582,788; and Saiki et al, U.S. Patent No. 4,683,194), using primer pairs that 
are capable of hybridizing to the proximal sequences that define a polymorphism in its 
double-stranded form. 

In lieu of PCR, alternative methods, such as the "Ligase Chain Reaction" 
("LCR") may be used (Barany, Proc. Natl. Acad. Sci. (U.S.A.) 88:189-193 (1991)). LCR 
uses two pairs of oligonucleotide probes to exponentially amplify a specific target. The 
sequences of each pair of oligonucleotides is selected to permit the pair to hybridize to 
abutting sequences of the same strand of the target. Such hybridization forms a substrate 
for a template-dependent ligase. As with PCR, the resulting products thus serve as a 
template in subsequent cycles and an exponential amplification of the desired sequence is 
obtained. 

LCR can be performed with oligonucleotides having the proximal and distal 
sequences of the same strand of a polymorphic site. In one embodiment, either 
oligonucleotide will be designed to include the actual polymorphic site of the 
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polymorphism. In such an embodiment, the reaction conditions are selected such that the 
oligonucleotides can be ligated together only if the target molecule either contains or 
lacks the specific nucleotide that is complementary to the polymorphic site present on the 
oligonucleotide. Alternatively, the oligonucleotides may be selected such that they do not 
include the polymorphic site (see, Segev, PCT Application WO 90/01069). 

The "Oligonucleotide Ligation Assay" ("OLA") may alternatively be employed . 
(Landegren et al, Science 241:1077-1080 (1988)). The OLA protocol uses two 
oligonucleotides that are designed to be capable of hybridizing to abutting sequences of a 
single strand of a target. OLA, like LCR, is particularly suited for the detection of point 
mutations. Unlike LCR, however, OLA results in "linear" rather than exponential 
amplification of the target sequence. 

Nickerson et al. have described a nucleic acid detection assay that combines 
attributes of PCR and OLA (Nickerson et al, Proc. Natl. Acad. Sci. (U.S.A.) 57:8923- 
8927 (1990)). In this method, PCR is used to achieve the exponential amplification of 
target DNA, which is then detected using OLA. In addition to requiring multiple, and 
separate, processing steps, one problem associated with such combinations is that they 
inherit all of the problems associated with PCR and OLA. 

Schemes based on ligation of two (or more) oligonucleotides in the presence of a 
nucleic acid having the sequence of the resulting "di-oligonucleotide," thereby amplifying 
the di-oligonucleotide, are also known (Wu etal, Genomics 4:560-569 (1989)), and may 
be readily adapted to the purposes of the present invention. 

Other known nucleic acid amplification procedures, such as allele-specific 
oligomers, branched DNA technology, transcription-based amplification systems, or 
isothermal amplification methods may also be used to amplify and analyze such 
polymorphisms (Malek et al, U.S. Patent 5,130,238; Davey et al, European Patent 
Application 329,822; Schuster et al, U.S. Patent 5,169,766; Miller et al, PCT Patent 
Application WO 89/06700; Kwoh, et al., Proc. Natl Acad. Sci. (U.S.A.) 56:1173-1177 
(1989); Gingeras et al, PCT Patent Application WO 88/10315; Walker et al, Proc. Natl. 
Acad. Sci. (U.S.A.) 59:392-396 (1992)). 
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Polymorphisms can also be identified by Single Strand Conformation 
Polymorphism (SSCP) analysis. SSCP is a method capable of identifying most sequence 
variations in a single strand of DNA, typically between 150 and 250 nucleotides in length 
(Elles, Methods in Molecular Medicine: Molecular Diagnosis of Genetic Diseases, 
Humana Press (1996); Orita et al, Genomics 5: 874-879 (1989)). Under denaturing 
conditions a single strand of DNA will adopt a conformation that is uniquely dependent 
on its sequence conformation. This conformation usually will be different, even if only a 
single base is changed. Most conformations have been reported to alter the physical 
configuration or size sufficiently to be detectable by electrophoresis. A number of 
protocols have been described for SSCP including, but not hmited to, Lee et al, Anal. 
Biochem. 205: 289-293 (1992); Suzuki etal, Anal. Biochem. 192: 82-84 (1991); Lo et 
al, Nucleic Acids Research 20: 1005-1009 (1992); Sarkar etal, Genomics i3;441-443 

(1992) . It is understood that one or more of the nucleic acids of the present invention can 
be utilized as markers or probes to detect polymorphisms by SSCP analysis. 

Polymorphisms may also be found using random amplified polymorphic DNA 
(RAPD) (Williams et al, Nucl. Acids Res. 18: 6531-6535 (1990)) and cleaveable 
amplified polymorphic sequences (CAPS) (Lyamichev et al, Science 260: 778-783 

(1993) ). It is understood that one or more of the nucleic acid molecules of the present 
invention can be utilized as markers or probes to detect polymorphisms by RAPD or 
CAPS analysis. 

The identification of a polymorphism can be determined in a variety of ways. By 
correlating the presence or absence of it in a plant with the presence or absence of a 
phenotype, it is possible to predict the phenotype of that plant. If a polymorphism creates 
or destroys a restriction endonuclease cleavage site, or if it results in the loss or insertion 
of DNA (e.g., a variable nucleotide tandem repeat (VNTR) polymorphism), it will alter 
the size or profile of the DNA fragments that are generated by digestion with that 
restriction endonuclease. As such, individuals that possess a variant sequence can be 
distinguished from those having the original sequence by restriction fragment analysis. 
Polymorphisms that can be identified in this manner are termed "restriction fragment 
length polymorphisms" ("RFLPs"). RFLPs have been widely used hi human and plant 
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genetic analyses (Glassberg, UK Patent Application 2135774; Skolnick et al, Cytogen. 
Cell Genet. 52:58-67 (1982); Botstein etal.,Ann. J. Hum. Genet. 32:314-331 (1980); 
Fischer et al. (PCT Application WO90/13668); Uhlen, PCT Application WO90/11369)). 

A central attribute of "single nucleotide polymorphisms," or "SNPs" is that the 
site of the polymorphism is at a single nucleotide. SNPs have certain reported advantages 
over RFLPs and VNTRs. First, SNPs are more stable than other classes of 
polymorphisms. Their spontaneous mutation rate is approximately 10" 9 (Kornberg, DNA 
Replication, W.H. Freeman & Co., San Francisco, 1980), approximately 1,000 times less 
frequent than VNTRs (U.S. Patent 5,679,524). Second, SNPs occur at greater frequency, 
and with greater uniformity than RFLPs and VNTRs. As SNPs result from sequence 
variation, new polymorphisms can be identified by sequencing random genomic or cDNA 
molecules. SNPs can also result from deletions, point mutations and insertions. Any 
single base alteration, whatever the cause, can be an SNP. The greater frequency of SNPs 
means that they can be more readily identified than the other classes of polymorphisms. 

SNPs and insertion/deletions can be detected by methods, by any of a variety of 
methods including those disclosed in U.S. Patents 5,210,015; 5,876,930 and 6,030,787 in 
which an oligonucleotide probe having reporter and quencher molecules is hybridized to a 
target polynucleotide. The probe is degraded by 5' 3' exonuclease activity of a nucleic 
acid polymerase. A useful assay is available from AB Biosystems (850 Lincoln Centre 
Drive, Foster City, CA) as the Taqman® assay. 

Specific nucleotide variations such as SNPs and insertion/deletions can also be 
detected by labeled base extension methods as disclosed in U.S. Patents 6,004,744; 
6,013,431; 5,595,890; 5,762,876; and 5,945,283. These methods are based on primer 
extension and incorporation of detectable nucleoside triphosphates. The primer is 
designed to anneal to the sequence immediately adjacent to the variable nucleotide which 
can be can be detected after incorporation of as few as one labeled nucleoside 
triphosphate. US Patent 5,468,613 discloses allele specific oligonucleotide hybridizations 
where single or multiple nucleotide variations in nucleic acid sequence can be detected in 
nucleic acids by a process in which the sequence containing the nucleotide variation is 
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amplified, spotted on a membrane and treated with a labeled sequence-specific 
oligonucleotide probe. 

Such methods also include the direct or indirect sequencing of the site, the use of 
restriction enzymes where the respective alleles of the site create or destroy a restriction 
site, the use of allele-specific hybridization probes, the use of antibodies that are specific 
for the proteins encoded by the different alleles of the polymorphism or by other 
biochemical interpretation. SNPs can be sequenced by a number of methods. Two basic 
methods may be used for DNA sequencing, the chain termination method of Sanger et al, 
Proc. Natl. Acad. Sci. (U.S.A.) 74: 5463-5467 (1977), and the chemical degradation 
method of Maxam et al, Proc. Nat. Acad. Sci. (U.S.A.) 74: 560-564 (1977). Automation 
and advances in technology such as the replacement of radioisotopes with fluorescence- 
based sequencing have reduced the effort required to sequence DNA (Craxton, Methods, 
2: 20-26 (1991); Ju et al, Proc. Natl Acad. Sci. (U.S.A.) 92: 4347-4351 (1995); Tabor et 
al, Proc. Natl. Acad. Sci. (U.S.A.) 92: 6339-6343 (1995)). Automated sequencers are 
available from, for example, Pharmacia Biotech, Inc., Piscataway, New Jersey 
(Pharmacia ALF), LI-COR, Inc., Lincoln, Nebraska (LI-COR 4,000) and Millipore, 
Bedford, Massachusetts (Millipore BaseStation). 

In addition, advances in capillary gel electrophoresis have also reduced the effort 
required to sequence DNA and such advances provide a rapid high resolution approach 
for sequencing DNA samples (Swerdlow et al. , Nucleic Acids Res. 75:1415-1419 (1990); 
Smith, Nature 349:8 12-813 (1991); Luckey et al. , Methods Enzymol. 218: 154-172 
(1993); Lu et al, J. Chromatog. A. 680:491-501 (1994); Carson et al, Anal. Chem. 
65:3219-3226 (1993); Huang et al, Anal Chem. 64:2149-2154 (1992); Kheterpal et al, 
Electrophoresis 77:1852-1859 (1996); Quesada etal, Electrophoresis 77:1841-1851 
(1996); Baba, Yakugaku Zasshi 777:265-281 (1997), Marino, Appl. Theor. Electrophor. 
5:1-5 (1995)). 

The genetic linkage of marker molecules can be established by a gene mapping 
model such as, without limitation, the flanking marker model reported by Lander et al, 
Genetics, 727:185-199 (1989), and the interval mapping, based on maximum likelihood 
methods described by Lander et al, Genetics, 727:185-199 (1989), and implemented in 



61 



WO 01/51627 



PCT7US01/00552 



the software package MAPMAKER/QTL (Lincoln et al, Mapping Genes Controlling 
Quantitative Traits Using MAPMAKER/QTL, Whitehead Institute for Biomedical 
Research, Massachusetts, (1990). Additional software includes Qgene, Version 2.23 
(1996), Department of Plant Breeding and Biometry, 266 Emerson Hall, Cornell 
University, Ithaca, NY). Use of Qgene software is a particularly preferred approach. 

A maximum likelihood estimate (MLE) for the presence of a marker is 
calculated, together with an MLE assuming no QTL effect, to avoid false positives. A 
log 10 of an odds ratio (LOD) is then calculated as: LOD = logio (MLE for the presence of 
a QTL/MLE given no linked QTL). 

The LOD score essentially indicates how much more likely the data are to have 
arisen assuming the presence of a QTL than in its absence. The LOD threshold value for 
avoiding a false positive with a given confidence, say 95%, depends on the number of 
markers and the length of the genome. Graphs indicating LOD thresholds are set forth in 
Lander et al, Genetics, 727:185-199 (1989), and further described by Arus et al, Plant 
Breeding, Hayward et al. (eds.) Chapman & Hall, London, pp. 314-331 (1993). 

Additional models can be used. Many modifications and alternative approaches 
to interval mapping have been reported, including the use of non-parametric methods 
(Kruglyak et al, Genetics, 7 39:1421-1428 (1995)). Multiple regression methods or 
models can be also be used, in which the trait is regressed on a large number of markers 
(Jansen, Biometrics in Plant Breed, van Oijen, Jansen (eds.) Proceedings of the Ninth 
Meeting of the Eucarpia Section Biometrics in Plant Breeding, The Netherlands, pp. 116- 
124 (1994); Weber et al, Advances in Plant Breeding, Blackwell, Berlin, 16 (1994)). 
Procedures combining interval mapping with regression analysis, whereby the phenotype 
is regressed onto a single putative QTL at a given marker interval, and at the same time 
onto a number of markers that serve as 'cofactors,' have been reported by Jansen et al, 
Genetics, 736:1447-1455 (1994) and Zeng, Genetics, 736:1457-1468 (1994). Generally, 
the use of cofactors reduces the bias and sampling error of the estimated QTL positions 
(Utz et al, Biometrics in Plant Breeding, van Oijen et al. (eds.) Proceedings of the Ninth 
Meeting of the Eucarpia Section Biometrics in Plant Breeding, The Netherlands, pp. 195- 
204 (1994), thereby improving the precision and efficiency of QTL mapping (Zeng, 
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Genetics, 136: 1457-1468 (1994)). These models can be extended to multi-environment 
experiments to analyze genotype-environment interactions (Jansen et al, Theo. Appl. 
Genet. 91:33-31 (1995)). 

Selection of an appropriate mapping or segregation populations is important to 
map construction. The choice of appropriate mapping population depends on the type of 
marker systems employed (Tanksley et al, Molecular mapping plant chromosomes. 
Chromosome structure and function: Impact of new concepts J.P. Gustafson et al. (eds.), 
Plenum Press, New York, pp. 157-173 (1988)). Consideration must be given to the 
source of parents (adapted vs. exotic) used in the mapping population. Chromosome 
pairing and recombination rates can be severely disturbed (suppressed) in wide crosses 
(adapted x exotic) and generally yield greatly reduced linkage distances. Wide crosses 
will usually provide segregating populations with a relatively large array of 
polymorphisms when compared to progeny in a narrow cross (adapted x adapted). 

As used herein, the progeny include not only, without limitation, the products of 
any cross (be it a backcross or otherwise) between two plants, but all progeny whose 
pedigree traces back to the original cross. Specifically, without limitation, such progeny 
include plants that have 12.5% or less genetic material derived from one of the two 
originally crossed plants. As used herein, a second plant is derived from a first plant if 
the second plant's pedigree includes the first plant. 

An F 2 population is the first generation of selfmg after the hybrid seed is 
produced. Usually a single Fi plant is selfed to generate a population segregating for all 
the genes in Mendelian (1:2: 1) fashion. Maximum genetic information is obtained from a 
completely classified F 2 population using a codominant marker system (Mather, 
Measurement of Linkage in Heredity: Methuen & Co., (1938)). In the case of dominant 
markers, progeny tests (e.g., F 3 , BCF 2 ) are required to identify the heterozygotes, thus 
making it equivalent to a completely classified F 2 population. However, this procedure is 
often prohibitive because of the cost and time involved in progeny testing. Progeny 
testing of F 2 individuals is often used in map construction where phenotypes do not 
consistently reflect genotype (e.g., disease resistance) or where trait expression is 
controlled by a QTL. Segregation data from progeny test populations (e.g., F 3 or BCF 2 ) 
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can be used in map construction. Marker-assisted selection can then be applied to cross 
progeny based on marker-trait map associations (F 2 , F 3 ), where linkage groups have not 
been completely disassociated by recombination events (i.e., maximum disequilibrium). 

Recombinant inbred lines (RIL) (genetically related lines; usually >F 5 , developed 
from continuously selfing F 2 lines towards homozygosity) can be used as a mapping 
population. Information obtained from dominant markers can be maximized by using 
RIL because all loci are homozygous or nearly so. Under conditions of tight linkage (i.e. , 
about <10% recombination), dominant and co-dominant markers evaluated in RIL 
populations provide more information per individual than either marker type in backcross 
populations (Reiter et al, Proc. Natl. Acad. Sci. (U.S.A.) 59:1477-1481 (1992)). 
However, as the distance between markers becomes larger (i.e., loci become more 
independent), the information in RIL populations decreases dramatically when compared 
to codominant markers. 

Backcross populations (e.g., generated from a cross between a successful variety 
(recurrent parent) and another variety (donor parent) carrying a trait not present in the 
former) can be utilized as a mapping population. A series of backcrosses to the recurrent 
parent can be made to recover most of its desirable traits. Thus a population is created 
consisting of individuals nearly like the recurrent parent but each individual carries 
varying amounts or mosaic of genomic regions from the donor parent. Backcross 
populations can be useful for mapping dominant markers if all loci in the recurrent parent 
are homozygous and the donor and recurrent parent have contrasting polymorphic marker 
alleles (Reiter et al, Proc. Natl. Acad. Sci. (U.S.A.) 89: 1477-1481 (1992)). Information 
obtained from backcross populations using either codominant or dominant markers is less 
than that obtained from F 2 populations because one, rather than two, recombinant gametes 
are sampled per plant. Backcross populations, however, are more informative (at low 
marker saturation) when compared to RILs as the distance between linked loci increases 
in RIL populations (i.e., about . 15% recombination). Increased recombination can be 
beneficial for resolution of tight linkages, but may be undesirable in the construction of 
maps with low marker saturation. 
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Near-isogenic lines (ML) created by many backcrosses to produce an array of 
individuals that are nearly identical in genetic composition except for the trait or genomic 
region under interrogation can be used as a mapping population. In mapping with NILs, 
only a portion of the polymorphic loci are expected to map to a selected region. 

Bulk segregant analysis (BSA) is a method developed for the rapid identification 
of linkage between markers and traits of interest (Michelmore, et al, Proc. Natl. Acad. 
Sci. (U.S.A.) 88:9828-9832 (1991)). In BSA, two bulked DNA samples are drawn from a 
segregating population originating from a single cross. These bulks contain individuals 
that are identical for a particular trait (resistant or sensitive to particular disease) or 
genomic region but arbitrary at unlinked regions (i.e., heterozygous). Regions unlinked 
to the target region will not differ between the bulked samples of many individuals in 
BSA. 

Plants generated using a method of the present invention can be part of or 
generated from a breeding program. The choice of breeding method depends on the mode 
of plant reproduction, the heritability of the trait(s) being improved, and the type of 
cultivar used commercially (e.g., Fi hybrid cultivar, pureline cultivar, etc). Selected, non- 
limiting approaches, for breeding the plants of the present invention are set forth below. 
A breeding program can be enhanced using marker assisted selection of the progeny of 
any cross. It is further understood that any commercial and non-commercial cultivars can 
be utilized in a breeding program. Factors such as, for example, emergence vigor, 
vegetative vigor, stress tolerance, disease resistance, branching, flowering, seed set, seed 
size, seed density, standability, and threshability etc. will generally dictate the choice. 

For highly heritable traits, a choice of superior individual plants evaluated at a 
single location will be effective, whereas for traits with low heritability, selection should 
be based on mean values obtained from replicated evaluations of families of related 
plants. Popular selection methods commonly include pedigree selection, modified 
pedigree selection, mass selection, and recurrent selection. In a preferred embodiment a 
backcross or recurrent breeding program is undertaken. 

The complexity of inheritance influences choice of the breeding method. 
Backcross breeding can be used to transfer one or a few favorable genes for a highly 
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heritable trait into a desirable cultivar. This approach has been used extensively for 
breeding disease-resistant cultivars. Various recurrent selection techniques are used to 
improve quantitatively inherited traits controlled by numerous genes. The use of 
recurrent selection in self-pollinating crops depends on the ease of pollination, the 
frequency of successful hybrids from each pollination, and the number of hybrid offspring 
from each successful cross. 

Breeding lines can be tested and compared to appropriate standards in 
environments representative of the commercial target area(s) for two or more generations. 
The best lines are candidates for new commercial cultivars; those still deficient in traits 
may be used as parents to produce new populations for further selection. 

One method of identifying a superior plant is to observe its performance relative 
to other experimental plants and to a widely grown standard cultivar. If a single 
observation is inconclusive, replicated observations can provide a better estimate of its 
genetic worth. A breeder can select and cross two or more parental lines, followed by 
repeated selling and selection, producing many new genetic combinations. 

The development of new soybean cultivars requires the development and 
selection of soybean varieties, the crossing of these varieties and selection of superior 
hybrid crosses. The hybrid seed can be produced by manual crosses between selected 
male-fertile parents or by using male sterility systems. Hybrids are selected for certain 
single gene traits such as pod color, flower color, seed yield, pubescence color or 
herbicide resistance which indicate that the seed is truly a hybrid. Additional data on 
parental lines, as well as the phenotype of the hybrid, influence the breeder's decision 
whether to continue with the specific hybrid cross. 

Pedigree breeding and recurrent selection breeding methods can be used to 
develop cultivars from breeding populations. Breeding programs combine desirable traits 
from two or more cultivars or various broad-based sources into breeding pools from 
which cultivars are developed by selfing and selection of desired phenotypes. New 
cultivars can be evaluated to determine which have commercial potential. 

Pedigree breeding is used commonly for the improvement of self-pollinating 
crops. Two parents who possess favorable, complementary traits are crossed to produce 



66 



WO 01/51627 



PCT7US01/00552 



an Fi. An F 2 population is produced by selfrng one or several Fi's. Selection of the best 
individuals in the best families is performed. Replicated testing of families can begin in 
the F 4 generation to improve the effectiveness of selection for traits with low heritability. 
At an advanced stage of inbreeding (i.e., F 6 and F 7 ), the best lines or mixtures of 
phenotypically similar lines are tested for potential release as new cultivars. 

Backcross breeding has been used to transfer genes for a simply inherited, highly 
heritable trait into a desirable homozygous cultivar or inbred line, which is the recurrent 
parent. The source of the trait to be transferred is called the donor parent. The resulting 
plant is expected to have the attributes of the recurrent parent (e.g., cultivar) and the 
desirable trait transferred from the donor parent. After the initial cross, individuals 
possessing the phenotype of the donor parent are selected and repeatedly crossed 
(backcrossed) to the recurrent parent. The resulting parent is expected to have the 
attributes of the recurrent parent (e.g., cultivar) and the desirable trait transferred from the 
donor parent. 

The single-seed descent procedure in the strict sense refers to planting a 
segregating population, harvesting a sample of one seed per plant, and using the one-seed 
sample to plant the next generation. When the population has been advanced from the F 2 
to the desired level of inbreeding, the plants from which lines are derived will each trace 
to different F 2 individuals. The number of plants in a population declines each generation 
due to failure of some seeds to germinate or some plants to produce at least one seed. As 
a result, not all of the F 2 plants originally sampled in the population will be represented by 
a progeny when generation advance is completed. 

In a multiple-seed procedure, soybean breeders commonly harvest one or more 
pods from each plant in a population and thresh them together to form a bulk. Part of the 
bulk is used to plant the next generation and part is put in reserve. The procedure has 
been referred to as modified single-seed descent or the pod-bulk technique. 

The multiple-seed procedure has been used to save labor at harvest. It is 
considerably faster to thresh pods with a machine than to remove one seed from each by 
hand for the single-seed procedure. The multiple-seed procedure also makes it possible to 
plant the same number of seeds of a population each generation of inbreeding. 
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Descriptions of other breeding methods that are commonly used for different 
traits and crops can be found in one of several reference books (e.g., Fehr, Principles of 
Cultivar Development Vol. 1, pp. 2-3 (1987)). 

In a preferred aspect of the present invention the source of the rhgl SCN resistant 
allele for use in a breeding program is derived from a plant selected from the group 
consisting of PI548402 (Peking), PI200499, A2869, Jack, A2069, PI209332 (No:4), 
PI404166 (Krasnoaarmejkaja), PI404198 (Sun huan do), PI437654 (Er-hej-jan), 
PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), PI548988 (Pickett), 
PI84751, PI437654, PI40792, Pyramid, Nathan, AG2201, A3469, AG3901, A3904, 
AG4301, AG4401, AG4501, AG4601, PION9492, PI88788, Dyer, Custer, Manokin, 
Doles, and SCN resistant progeny thereof (USD A, Soybean Germplasm Collection, 
University of Illinois, Illinois). In a more preferred aspect, the source of the rhgl SCN 
resistant allele for use in a breeding program is derived from a plant selected from the 
group consisting of PI548402 (Peking), PI404166 (Krasnoaarmejkaja), PI404198 (Sun 
huan do), PI437654 (Er-hej-jan), PI438489 (Chiquita), PK07354 (Tokei 421), PI548655 
(Forrest), PI548988 (Pickett), PI84751, PI437654, PI40792, and SCN resistant progeny 
thereof. 

In a preferred aspect of the present invention the source of the rhgl SCN 
sensitive allele for use in a breeding program is derived from a plant selected from the 
group consisting of A3244, A2833, AG3001, Williams, Will, A2704, Noir, DK23-51, 
Lee 74, Essex, Minsoy, A1923, Hutcheson, and SCN sensitive progeny thereof. In a 
more preferred aspect, the source of the rhgl SCN sensitive allele for use in a breeding 
program is derived from an A3244 plant, and SCN sensitive progeny thereof. 

In a preferred aspect of the present invention the source of the Rhg4 SCN 
resistant allele for use in a breeding program is derived from a plant selected from the 
group consisting of PI548402 (Peking), PI437654 (Er-hej-jan), PI438489 (Chiquita), 
PI507354 (Tokei 421), PK48655 (Forrest), PI548988 (Pickett), PI88788, PI404198 (Sun 
Huan Do), PI404166 (Krasnoaarmejkaja), Hartwig, Manokin, Doles, Dyer, Custer, and 
SCN resistant progeny thereof. In a more preferred aspect, the source of the RhgA SCN 
resistant allele for use in a breeding program is derived from a plant selected from the 
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group consisting of PI548402 (Peking), PI88788, PI404198 (Sun huan do), PI438489 
(Chiquita), PI437654 (Er-hej-jan), PI404166 (Krasnoaarmejkaja), PI548655 (Forrest), 
PI548988 (Pickett), PI507354 (Tokei 421), and SCN resistant progeny thereof. 

In a preferred aspect of the present invention the source of the RhgA SCN 
sensitive allele for use in a breeding program is derived from a plant selected from the 
group consisting of A3244, Will, Noir, Lee 74, Essex, Minsoy, A2704, A2833, AG3001, 
Williams, DK23-51, and Hutcheson, and SCN sensitive progeny thereof. In a more 
preferred aspect, the source of the RhgA SCN sensitive allele for use in a breeding 
program is derived from an A3244 plant, and SCN sensitive progeny thereof. 

As used herein linkage of a nucleic acid sequence with another nucleic acid 
sequence may be genetic or physical. In a preferred embodiment, a nucleic acid marker is 
genetically linked to either rhgl or RhgA, where the marker nucleic acid molecule 
exhibits a LOD score of greater than 2.0, as judged by interval mapping, for SCN 
resistance or partial resistance, preferably where the marker nucleic acid molecule 
exhibits a LOD score of greater than 3.0, as judged by interval mapping, for SCN 
resistance or partial resistance, more preferably where the marker nucleic acid molecule 
exhibits a LOD score of greater than 3.5, as judged by interval mapping, for SCN 
resistance or partial resistance and even more preferably where the marker nucleic acid 
molecule exhibits a LOD score of about 4.0, as judged by interval mapping, for SCN 
resistance or partial resistance based on maximum likelihood methods described by 
Lander et al, Genetics, 727:185-199 (1989), and implemented in the software package 
MAPMAKER/QTL (default parameters)(Lincoln et al, Mapping Genes Controlling 
Quantitative Traits Using MAPMAKER/QTL, Whitehead Institute for Biomedical 
Research, Massachusetts, (1990)). 

hi another embodiment the nucleic acid molecule may be physically linked to 
either rhgl or RhgA. hi a preferred embodiment, the nucleic acid marker specifically 
hybridizes to a nucleic acid molecule having a sequence that is present on linkage group 
G within 500 kb or lOOkb, more preferably within 50kb, even more preferably within 
25kb of an rhgl allele, where the rghl allele is preferably a sensitive allele, and more 
preferably a sensitive allele from A3244. Li a preferred embodiment the nucleic acid 
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marker is capable of specifically hybridizing to a nucleic acid molecule having a sequence 
that is present on linkage group A2 within 500kb or lOOkb, more preferably within 50kb, 
even more preferably within 25kb of an RhgA allele, where the RghA allele is preferably a 
sensitive allele, and more preferably a sensitive allele from A3244. 

The present invention provides a method of investigating an rhgl haplotype of a 
soybean plant comprising: (A) isolating nucleic acid molecules from the soybean plant; 
(B) determining the nucleic acid sequence of an rhgl allele or part thereof; and, (C) 
comparing the nucleic acid sequence of the rhgl allele or part thereof to a reference 
nucleic acid sequence. 

As used herein, the term "investigating" refers to any method capable of detecting 
a feature, such as a polymorphism or haplotype. Nucleic acid molecules only need to be 
isolated from a soybean plant to the degree of purity necessary for the task required or to 
a greater purity if desired. A person of ordinary skill in the art has available techniques to 
isolate nucleic acid molecules from plants to a sufficient purity, for example without 
limitation, to sequence the desired region of the nucleic acid molecule or to carry out a 
marker assay. 

The determination of an rhglot RhgA allele or part thereof may be carried out 
using any technique. Illustration of such techniques include techniques that provide the 
nucleic acid sequence for an rhgl or rhgA allele or part thereof include amplification of a 
desired allele or part thereof (see, for example, the Examples and SEQ ID NOs: 8-53). In 
a preferred embodiment, the nucleic acid sequence determined is that of an exon of an 
rhgl allele, more preferably exon 1 or exon 3 of an rhgl allele, or of an LRR domain. In 
another preferred embodiment, a single nucleotide is determined. In another preferred 
embodiment, the nucleic acid sequence determined is that of an LRR domain. 

A comparison of a sequence with a reference sequence can be carried out with 
any appropriate sequence comparison method. 

As used herein, a reference sequence is any rhgl allele sequence or consensus 
sequence. A reference sequence may be a nucleic acid sequence or an amino acid 
sequence. In a preferred embodiment, the reference sequence is any SCN resistant rhgl 
allele sequence. In a further preferred embodiment, the rhgl reference sequence is 
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selected from the group consisting of SEQ ID NOs: 2, 3, 5, 6, 8-23, 28-43, 1097, 1098, 
and 1100-1115. 

The present invention provides a method of investigating an RhgA haplotype of a 
soybean plant comprising: (A) isolating nucleic acid molecules from the soybean plant; 
(B) determining the nucleic acid sequence of an RhgA allele or part thereof; and (C) 
comparing the nucleic acid sequence of the RhgA allele or part thereof to a reference 
nucleic acid sequence. 

As used herein, a reference sequence is any RhgA allele sequence or consensus 
sequence. A reference sequence ma be a nucleic acid sequence or an amino acid 
sequence. In a preferred embodiment, the reference sequence is any SCN resistant RhgA 
allele sequence. In a further preferred embodiment, the RhgA reference sequence is 
selected from the group consisting of SEQ ID NOs: 4, 7, 44-47, 50-53, 1099, and 1 1 16- 
1119. 

The present invention provides a method of introgressing SCN resistance or 
partial SCN resistance into a soybean plant comprising: performing marker assisted 
selection of the soybean plant with a nucleic acid marker, wherein the nucleic acid marker 
specifically hybridizes with a nucleic acid molecule having a first nucleic acid sequence 
that is physically linked to a second nucleic acid sequence that is located on linkage group 
G of soybean A3244, wherein the second nucleic acid sequence is within 500 kb of a third 
nucleic acid sequence which is capable of specifically hybridizing with the nucleic acid 
sequence of SEQ ID NO: 5, 6, complements thereof, or fragments thereof; and, selecting 
the soybean plant based on the marker assisted selection. 

The present invention provides a method of introgressing SCN resistance or 
partial SCN resistance into a soybean plant comprising: performing marker assisted 
selection of the soybean plant with a nucleic acid marker, wherein the nucleic acid marker 
specifically hybridizes with a nucleic acid molecule having a first nucleic acid sequence 
that is physically linked to a second nucleic acid sequence that is located on linkage group 
A2 of soybean A3244, wherein the second nucleic acid sequence is within 500 kb of a 
third nucleic acid sequence which is capable of specifically hybridizing with the nucleic 
acid sequence of SEQ ID NO: 7, complements thereor, or fragments thereof; and, 
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selecting the soybean plant based on the marker assisted selection. Marker assisted 
introgression of traits into plants has been reported. Marker assisted introgression 
involves the transfer of a chromosome region defined by one or more markers from one 
germplasm to a second germplasm. In a preferred embodiment the introgression is 
carried out by backcrossing with an rhgl or RhgA SCN resistant soybean recurrent parent. 

In light of the current disclosure, plant introductions and germplasm can be 
screened with a marker nucleic acid molecule of the present invention to screen for alleles 
of rhgl or RhgA using one or more of techniques disclosed herein or known in the art. 

The present invention also provides for parts of the plants produced by a method 
of the present invention. Plant parts, without limitation, include seed, endosperm, ovule 
and pollen. In a particularly preferred embodiment of the present invention, the plant part 
is a seed. 

Plants or parts thereof produced by a method of the present invention may be 
grown in culture and regenerated. Methods for the regeneration of soybean plants from 
various tissue types and methods for the tissue culture of soybean are known in the art 
(See, for example, Widholm et al, In Vitro Selection and Culture-induced Variation in 
Soybean, In Soybean: Genetics, Molecular Biology and Biotechnology, Eds. Verma et al, 
CAB International, Wallingford, Oxon, England (1996)). Regeneration techniques for 
plants such as soybean can use as the starting material a variety of tissue or cell types. 
With soybean in particular, regeneration processes have been developed that begin with 
certain differentiated tissue types such as meristems, Cartha et al, Can. J. Bot. 59:1671- 
1679 (1981), hypocotyl sections, Cameya et al, Plant Science Letters 21: 289-294 
(1981), and stem node segments, Saknetal, Plant Science Letters, 19: 193-201 (1980); 
Cheng et al, Plant Science Letters, 19: 91-99 (1980). Regeneration of whole sexually 
mature soybean plants from somatic embryos generated from explants of immature 
soybean embryos has been reported (Ranch et al., In Vitro Cellular & Developmental 
Biology 21: 653-658 (1985). Regeneration of mature soybean plants from tissue culture 
by organogenesis and embryogenesis has also been reported (Barwale et al, Planta 167: 
473-481 (1986); Wright et al, Plant Cell Reports 5: 150-154 (1986)). 
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Agents 

One skilled in the art can refer to general reference texts for detailed descriptions 
of known techniques discussed herein or equivalent techniques. These texts include 
Current Protocols in Molecular Biology Ausubel, et al, eds., John Wiley & Sons, N.Y. 
(1989), and supplements through September (1998), Molecular Cloning, A Laboratory 
Manual, Sambrook et al, 2 nd Ed., Cold Spring Harbor Press, Cold Spring Harbor, New 
York (1989), Genome Analysis: A Laboratory Manual 1: Analyzing DNA, Birren et al, 
Cold Spring Harbor Press, Cold Spring Harbor, New York (1997); Genome Analysis: A 
Laboratory Manual 2: Detecting Genes, Birren et al, Cold Spring Harbor Press, Cold 
Spring Harbor, New York (1998); Genome Analysis: A Laboratory Manual 3: Cloning 
Systems, Birren et al, Cold Spring Harbor Press, Cold Spring Harbor, New York (1999); 
Genome Analysis: A Laboratory Manual 4: Mapping Genomes, Birren et al, Cold Spring 
Harbor Press, Cold Spring Harbor, New York (1999); Plant Molecular Biology: A 
Laboratory Manual, Clark, Springer-Verlag, Berlin, (1997), Methods in Plant Molecular 
Biology, Maliga et al, Cold Spring Harbor Press, Cold Spring Harbor, New York (1995). 
These texts can, of course, also be referred to in making or using an aspect of the 
invention. It is understood that any of the agents of the invention can be substantially 
purified and/or be biologically active and/or recombinant. 

(a) Nucleic Acid Molecules 

Nucleic acid molecules of the present invention include, without limitation, 
nucleic acid molecules having a nucleic acid sequence selected from the group consisting 
of SEQ ED NOs: 1-1096 and complements thereof. A subset of the nucleic acid 
molecules of the present invention includes nucleic acid molecules that encode a protein 
or fragment thereof. Another subset of the nucleic acid molecules of the present 
invention are cDNA molecules. Another subset of the nucleic acid molecules of the 
present invention includes nucleic acid molecules that are marker molecules. A further 
subset of the nucleic acid molecules of the present invention are those nucleic acid 
molecules having promoter sequences. 

Fragment nucleic acid molecules may comprise significant portion(s) of, or 
indeed most of, these nucleic acid molecules. In preferred embodiments, the fragments 
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may comprise smaller polynucleotides, e.g., oligonucleotides having from about 20 to 
about 250 nucleotide residues and more preferably, about 20 to about 100 nucleotide 
residues, or about 40 to about 60 nucleotide residues. In another preferred embodiment, 
fragment molecules may be at least 15 nucleotides, at least 30 nucleotides, at least 50 
nucleotides, or at least 100 nucleotides. 

The term "substantially purified," as used herein, refers to a molecule separated 
from substantially all other molecules normally associated with it in its native state. More 
preferably a substantially purified molecule is the predominant species present in a 
preparation. A substantially purified molecule may be greater than 60% free, preferably 
75% free, more preferably 90% free, and most preferably 95% free from the other 
molecules (exclusive of solvent) present in the natural mixture. The term "substantially 
purified" is not intended to encompass molecules present in their native state. 

The agents of the present invention will preferably be "biologically active" with 
respect to either a structural attribute, such as the capacity of a nucleic acid to hybridize to 
another nucleic acid molecule, or the ability of a protein to be bound by an antibody (or to 
compete with another molecule for such binding). Alternatively, such an attribute may be 
catalytic and thus involve the capacity of the agent to mediate a chemical reaction or 
response. 

The agents of the present invention may also be recombinant. As used herein, the 
term recombinant describes (a) nucleic acid molecules that are constructed or modified 
outside of cells and that can replicate or function in a living cell, (b) molecules that result 
from the transcription, replication or translation of recombinant nucleic acid molecules, or 
(c) organisms that contain recombinant nucleic acid molecules or are modified using 
recombinant nucleic acid molecules. 

It is understood that the agents of the present invention may be labeled with 
reagents that facilitate detection of the agent, e.g., fluorescent labels, (Prober et al, 
Science 238:336-340 (1987); Albarella et al, EP 144914), chemical labels, (Sheldon et 
al, U.S. Patent 4,582,789; Albarella et al, U.S. Patent 4,563,417), and modified bases, 
(Miyoshi et al, EP 119448) including nucleotides with radioactive elements, e.g., 32 P, 33 P, 
35 S or 123 I, such as 32 P dCTP. 
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It is further understood, that the present invention provides recombinant bacterial, 
animal, fungal and plant cells and viral constructs comprising the agents of the present 
invention. 

Nucleic acid molecules or fragments thereof of the present invention are capable 
of specifically hybridizing to other nucleic acid molecules under certain circumstances. 
As used herein, two nucleic acid molecules are said to be capable of specifically 
hybridizing to one another if the two molecules are capable of forming an anti-parallel, 
double-stranded nucleic acid structure. A nucleic acid molecule is said to be the 
"complement" of another nucleic acid molecule if they exhibit "complete 
complementarity," i.e., each nucleotide in one sequence is complementary to its base 
pairing partner nucleotide in another sequence. Two molecules are said to be "minimally 
complementary" if they can hybridize to one another with sufficient stability to permit 
them to remain annealed to one another under at least conventional "low-stringency" 
conditions. Similarly, the molecules are said to be "complementary" if they can hybridize 
to one another with sufficient stability to permit them to remain annealed to one another 
under conventional "high-stringency" conditions. Nucleic acid molecules which 
hybridize to other nucleic acid molecules, e.g., at least under low stringency conditions 
are said to be "hybridizable cognates" of the other nucleic acid molecules. Conventional 
stringency conditions are described by Sambrook et at, Molecular Cloning, A Laboratory 
Manual, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor, New York (1989) and 
by Haymes et al, Nucleic Acid Hybridization, A Practical Approach, IRL Press, 
Washington, DC (1985). Departures from complete complementarity are therefore 
permissible, as long as such departures do not completely preclude the capacity of the 
molecules to form a double-stranded structure. Thus, in order for a nucleic acid molecule 
to serve as a primer or probe it need only be sufficiently complementary in sequence to be 
able to form a stable double-stranded structure under the particular solvent and salt 
concentrations employed. 

Appropriate stringency conditions which promote DNA hybridization, for 
example, 6.0 X sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash 
of 2.0 X SSC at 50°C, are known to those skilled in the art or can be found in Current 
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Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. For 
example, the salt concentration in the wash step can be selected from a low stringency of 
about 2.0 X SSC at 50°C to a high stringency of about 0.2 X SSC at 50°C. hi addition, 
the temperature in the wash step can be increased from low stringency conditions at room 
temperature, about 22°C, to high stringency conditions at about 65 °C. Both temperature 
and salt may be varied, or either the temperature or the salt concentration may be held 
constant while the other variable is changed. 

In a preferred embodiment, a nucleic acid of the present invention will 
specifically hybridize to one or more of the nucleic acid molecules set forth in SEQ ID 
NO: 1 through SEQ ID NO: 1096 or complements thereof under moderately stringent 
conditions, for example at about 2.0 X SSC and about 65°C. 

In a particularly preferred embodiment, a nucleic acid of the present invention 
will include those nucleic acid molecules that specifically hybridize to one or more of the 
nucleic acid molecules set forth in SEQ ID NO: 1 through SEQ ID NO: 1096 or 
complements thereof under high stringency conditions such as 0.2 X SSC and about 
65°C. 

In one aspect of the present invention, the nucleic acid molecules of the present 
invention comprise one or more of the nucleic acid sequences set forth in SEQ ID NO: 1 
through SEQ ED NO: 1096 or complements thereof or fragments of either. In another 
aspect of the present invention, one or more of the nucleic acid molecules of the present 
invention share at least 60% sequence identity with one or more of the nucleic acid 
sequences set forth in SEQ ID NO: 1 through SEQ ID NO: 1096 or complements thereof 
or fragments of either. In a further aspect of the present invention, one or more of the 
nucleic acid molecules of the present invention share at least 70% or more, e.g., at least 
80%, sequence identity with one or more of the nucleic acid sequences set forth in SEQ 
ID NO: 1 through SEQ ID NO: 1096 or complements thereof or fragments of either. In a 
more preferred aspect of the present invention, one or more of die nucleic acid molecules 
of the present invention share at least 90% or more, e.g., at least 95% and up to 100% 
sequence identity with one or more of the nucleic acid sequences set forth in SEQ ID NO: 
1 through SEQ ID NO: 1096 complements thereof or fragments of either. 
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As used herein "sequence identity" refers to the extent to which two optimally 
aligned polynucleotide or peptide sequences are invariant throughout a window of 
alignment of components, e.g., nucleotides or amino acids. An "identity fraction" for 
aligned segments of a test sequence and a reference sequence is the number of identical 
components which are shared by the two aligned sequences divided by the total number 
of components in reference sequence segment, i.e., the entire reference sequence or a 
smaller defined part of the reference sequence. "Percent identity" is the identity fraction 
times 100. 

Useful methods for determining sequence identity are disclosed in Guide to Huge 
Computers, Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo et al, 
SIAM J Applied Math 48: 1073 (1988). More particularly, preferred computer programs 
for determining sequence identity include the Basic Local Alignment Search Tool 
(BLAST) programs which are publicly available from National Center Biotechnology 
Information (NCBI) at the National Library of Medicine, National Institute of Health, 
Bethesda, Md. 20894; see BLAST Manual, Altschul et aL, NCBI, NLM, NIH; Altschul et 
al, J. Mol. Biol. 2/5:403-410 (1990); version 2.0 or higher of BLAST programs allows 
the introduction of gaps (deletions and insertions) into alignments; BLASTX can be used 
to determine sequence identity between a polynucleotide sequence query and a protein 
sequence database; and, BLASTN can be used to determine sequence identity between 
between sequences. 

For purposes of this invention "percent identity" shall be determined using 
BLASTX version 2.0.14 (default parameters), BLASTN version 2.0.14, or BLASTP 
2.0.14. 

A particularly preferred group of nucleic acid sequences are those present in the 
soybean insert of the clones set forth in table 6 below. 



TABLE 6 



Line 


Names of Clones Containing the Specified Gene 


Rh}>4 


rhgl/fcag 1 


r/^l/frag 2 


Forrest 


Forrest 1 


Forrest 7 


Forrestl3 


Peking 


Peking 1 


Peking 7 


Peking 13 


Pickett 


Pickett 1 


Pickett 7 


Pickett 13 


PI84751 


PI 84751.1 


PI 84751.7 


PI 84751.13 
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Line 


Names of Clones Containing the Specified Gene 


RhgA 


rhgl/fraz 1 


rhgl/tvag 2 


PI87631 


PI 87631.1 


PI 87631.7 


PI 87631.13 


PI87631-1 


PI 87631-1.1 




PI 87631-1.13 


PI88788R 


PI88788R.1 


PI 88788R.7 


PI88788R.13 


PI89772 






PI 89772.13 


PI90763 




PI 90763.7 


PI 90763.13 


PI200499 


PI 200499.1 


PI 200499.7 


PI 200499.13 


PI209332 


PI 209332.1 




PI 209332. 13 


PI404166 


PI 404166.1 


PI 404166.7 


PI 404166.13 


PI404198A 




PI404198A.7 


PI404198A.13 


PI404198B 


PI404198B.1 


PI 404198B.7 


PI404198B.13 


PI437654 


PI 437654.1 


PI 437654.7 


PI 437654.13 


PI438489B 


PI 438489.1 


PI 438489.7 


PI 438489B.13 


PI467312 


PI 467312.1 


PI 467312.7 


PI 467312.13 


PI507354 


PI 507354.1 


PI 507354.7 


PI 507354.13 


PI423871 


PI 423871.1 


PI 423871.7 


PI 423871.13 


PI407922 




PI 407922.7 


PI 407922.13 


PI360843 


PI 360843.1 


PI 360843.7 


PI 360843.13 


A2869 


A2869.1 


A 2869.7 


A2869.13 


A2069 


A2069.1 




A2069.13 


Jack 


JACK1 




JACK 13 


Will 


WILLI 


WILL.7 


WILL13 


Minsoy 


Minsoy 1 


Minsoy.7 


MINSOY 13 


Noir 


Noirl 


Noir.7 


NOIR 13 


Hutcheson 


Hutcheson 1 


Hutcheson.7 


Hutcheson. 13 


A1923 


A1923.1 


A1923.7 


A1923.13 


A2704 




A2704.7 


A2704.13 


Essex 


Essexl 


Essex.7 


ESSEX13 


A3244 


A3244.1 


A 3244.7 


A3244.13 


Lee74 


Lee74.1 


Lee74.7 


Lee74.13 


PI437654 




R107C17.7 


R107C17.13 



Table 5 shows clones comprising rhgl and RhgA sequences. The "Lines" column 
indicates the cultivar from which the sequence in the clone is derived. The RhgA, 
r/zgl/fragl, and /7zgl/frag2 columns show the clones derived from the lines that have the 
RhgA, rhgl fragment 1, or rhgl fragment 2, respectively. RhgA is amplified with SEQ ED 
NOs: 48 and 49, which produces a 3.5 kb product, rhgl fragment 1 is amplified with 
SEQ ID NOs: 24 and 25, which produces a 2.9 kb product, and rhgl fragment 2 is 
amplified with SEQ JJD NOs: 26 and 27, which produces a 1.75 kb product. All 
fragments are subcloned into a pCR4-TOPO vector. 
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(i) Nucleic Acid Molecules Encoding Proteins or Fragments Thereof 

A) rhgl 

The present invention includes nucleic acid molecules that code for an rhgl 
protein or fragment thereof. Examples of such nucleic acid molecules include those that 
code for the proteins set forth in SEQ ID NOs: 1097, 1100, 1098, 1101, and 1102-1115. 
Examples of illustrative fragment molecules include, without limitation, an extracellular 
LRR domain (rhgl, v.l, SEQ ID NO: 1097, residues 164-457; rhgl, v.2, SEQ ID NO: 
1098, residues 141-434), a transmembrane domain (rhgl, v.l, SEQ ID NO: 1097, 
residues 508-530; rhgl, v.2, SEQ ID NO: 1098, residues 33-51 and 485-507), and an 
STK domain (rhgl, v.l, SEQ ID NO: 1097, residues 578-869; rhgl, v.2, SEQ ID NO: 
1098, residues 555-846). Examples of illustrative nucleic acid molecules include SEQ ID 
NOs: 5, 6, 8-23, and 28-43. 

B) RhgA 

The present invention includes nucleic acid molecules that code for an RhgA 
protein or fragment thereof. Examples of such nucleic acid molecules include those that 
code for the proteins set forth in SEQ ID NOs: 1099 and 1 1 16-1 1 19. Examples of 
illustrative fragment molecules include, without limitation, an extracellular LRR domain 
(SEQ ID NO: 1099, residues 34-44), a transmembrane domain (SEQ ID NO: 1099, 
residues 449-471), and an STK domain (SEQ ID NO: 1099, residues 531-830). Examples 
of illustrative nucleic acid molecules include SEQ ID NOs: 7, 44-47, and 50-53. 

C) Rhgl and RhgA 

In another further aspect of the present invention, nucleic acid molecules of the 
present invention can comprise sequences which differ from those encoding a protein or 
fragment thereof in SEQ ID NO: 1097 through SEQ ID NO: 1 1 19 due to fact that the 
different nucleic acid sequence encodes a protein having one or more conservative amino 
acid changes. It is understood that codons capable of coding for such conservative amino 
acid substitutions are known in the art. 

It is well known in the art that one or more amino acids in a native sequence can 
be substituted with another amino acid(s), the charge and polarity of which are similar to 
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that of the native amino acid, i.e., a conservative amino acid substitution. Conserved 
substitutions for an amino acid within the native polypeptide sequence can be selected 
from other members of the class to which the naturally occurring amino acid belongs. 
Amino acids can be divided into the following four groups: (1) acidic amino acids, (2) 
basic amino acids, (3) neutral polar amino acids, and (4) neutral nonpolar amino acids. 
Representative amino acids within these various groups include, but are not limited to: (1) 
acidic (negatively charged) amino acids such as aspartic acid and glutamic acid; (2) basic 
(positively charged) amino acids such as arginine, histidine, and lysine; (3) neutral polar 
amino acids such as glycine, serine, threonine, cysteine, cystine, tyrosine, asparagine, and 
glutamine; and (4) neutral nonpolar (hydrophobic) amino acids such as alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and methionine. 

Conservative amino acid changes within the native polypeptides sequence can be 
made by substituting one amino acid within one of these groups with another amino acid 
within the same group. Biologically functional equivalents of the proteins or fragments 
thereof of the present invention can have ten or fewer conservative amino acid changes, 
more preferably seven or fewer conservative amino acid changes, and most preferably 
five or fewer conservative amino acid changes. The encoding nucleotide sequence will 
thus have corresponding base substitutions, permitting it to encode biologically functional 
equivalent forms of the proteins or fragments of the present invention. 

It is understood that certain amino acids may be substituted for other amino acids 
in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites on 
substrate molecules. Because it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence and, of course, its underlying DNA 
coding sequence and, nevertheless, obtain a protein with like properties. It is thus 
contemplated by the inventors that various changes may be made in the peptide sequences 
of the proteins or fragments of the present invention, or corresponding DNA sequences 
that encode said peptides, without appreciable loss of their biological utility or activity. It 
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is understood that codons capable of coding for such amino acid changes are known in the 
art. 

In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring interactive 
biological function on a protein is generally understood in the art (Kyte et al, J. Mol. 
Biol. 157, 105-132 (1982). It is accepted that the relative hydropathic character of the 
amino acid contributes to the secondary structure of the resultant protein, which in turn 
defines the interaction of the protein with other molecules, for example, enzymes, 
substrates, receptors, DNA, antibodies, antigens, and the like. In making such changes, 
the substitution of amino acids whose hydropathic indices are within ±2 is preferred, 
those which are within ±1 are particularly preferred, and those within +0.5 are even more 
particularly preferred. 

It is also understood in the art that the substitution of like amino acids can be 
made effectively on the basis of hydrophilicity. U.S. Patent 4,554,101, states that the 
greatest local average hydrophilicity of a protein, as govern by the hydrophilicity of its 
adjacent amino acids, correlates with a biological property of the protein. In a further 
aspect of the present invention, one or more of the nucleic acid molecules of the present 
invention differ in nucleic acid sequence from those encoding a peptide set forth in SEQ 
ID NO: 1097 through SEQ ID NO: 1 1 19 or fragment thereof due to the fact that one or 
more codons encoding an amino acid has been substituted for a codon that encodes a 
nonessential substitution of the amino acid originally encoded. 

Agents of the invention include nucleic acid molecules that encode at least about 
a contiguous 10 amino acid region of a protein of the present invention, more preferably 
at least about a contiguous 1 1 to 14 or larger amino acid region of a protein of the present 
invention. It is understood that the present invention includes nucleic acid molecules that 
specifically hybridize or exhibit a particular identity to the nucleic acid molecules 
described in (i). See (a) above. 
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(ii) Nucleic Acid Molecule Markers and Collections of Such Molecules 

One aspect of the present invention concerns nucleic acid molecules of the 
present invention that can act as markers. As used herein, a "marker" is an indicator for 
the presence of at least one phenotype or polymorphism, such as single nucleotide 
polymorphisms (SNPs), cleaveable amplified polymorphic sequences (CAPS), amplified 
fragment length polymorphisms (AFLPs), restriction fragment length polymorphisms 
(RFLPs), simple sequence repeats (SSRs), or random amplified polymorphic DNA 
(RAPDs). A "nucleic acid marker" as used herein means a nucleic acid molecule that is 
capable of being a marker for detecting a polymorphism or phenotype. 

In one embodiment of the present invention, the nucleic acid marker specifically 
hybridizes to a nucleic acid molecule having a nucleic acid sequence selected from the 
group SEQ NOs: 1-1096 and complements thereof. In a preferred embodiment, the 
nucleic acid marker is capable of detecting an rhgl SNP or INDEL set forth in table 2. In 
a preferred embodiment, the nucleic acid marker is capable of detecting an RghA SNP or 
INDEL set forth in table 4. In another preferred embodiment the nucleic acid marker is a 
nucleic acid molecule capable of acting as a PCR primer to amplify an rhgl or Rhg4 
coding region. Examples of such primers include, without limitation, nucleic acid 
molecules having a nucleic acid sequence set forth in SEQ ID NO: 401 -1096 and 
complements thereof. Such primers can be used in pairs to amplify a region (amplicons, 
e.g., without limitation, SEQ ID NOs: 54-400) that can be further investigated using 
techniques known in the art such as nucleic acid sequencing. Preferred pairs are those 
with identical "Seq ID" (see Description of the Sequence Listing) except for the fact that 
one "Seq ID" recites forward primer and one recites reverse primer. 

In another embodiment of the present invention, the nucleic acid marker 
specifically hybridizes to a nucleic acid molecule having a sequence that is present on 
linkage group G within 500 kb or lOOkb, more preferably within 50kb, even more 
preferably within 25kb of an rhgl allele, where the Rgh4 allele is preferably a sensitive 
allele, and more preferably a sensitive allele from A3244. In a preferred embodiment the 
nucleic acid marker specifically hybridizes to a nucleic acid molecule having a sequence 
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that is present on linkage group A2 within 500kb or lOOkb, more preferably within 50kb, 
even more preferably within 25kb of an RhgA allele, where the RghA allele is preferably a 
sensitive allele, and more preferably a sensitive allele from A3244. 

As used herein, a "collection of nucleic acid molecules" is a population of nucleic 
acid molecules where at least two, preferably all, of the nucleic acid molecules differ, at 
least in part, in their nucleic acid sequence. It is understood, that as used herein, an 
individual species within a collection of nucleic acid molecules may be physically 
separate or alternatively not physically separate from one or more other species within the 
collection of nucleic acid molecules. An example of a situation where individual species 
may be physically separate but considered a collection of nucleic acid molecules is where 
more than two species are present in a single location such as an array. 

As used herein, where a collection of nucleic acid molecules is a marker for a 
particular attribute, the level, pattern, occurrence and/or absence of the nucleic acid 
molecules associated with the attribute are not required to be the same between species of 
the collection. For example, the increase in the level of a species when in combination 
with the decrease in a second species could be diagnostic for a particular attribute. In a 
preferred embodiment of the present invention, the level, pattern, occurrence and/or 
absence of a nucleic acid molecule and/or collection of nucleic acid molecules of the 
present invention is a marker for SCN resistance. 

In one embodiment, the marker is any nucleic acid molecule that specifically 
hybridizes to any nucleic acid sequence set forth herein. In another embodiment, the 
marker is a marker capable of distinguishing among the haplotypes of either rhgl or 
RhgA. In yet another embodiment, more than one marker is used to simultaneously 
distinguish more than one haplotype. In a preferred embodiment, two, three, four, six, 
eight, twenty five or fifty or more nucleic acid markers are used simultaneously. In 
another embodiment, one or more markers that are capable of distinguishing among the 
haplotypes of rhgl and one or more markers that are capable of distinguishing among the 
haplotypes of RhgA are used together. 
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(iii) Nucleic acid molecules having promoter sequences and other 
regulatory sequences 

The present invention includes nucleic acid molecules that are an rhgl or RhgA 
promoter or fragment thereof. Examples of such nucleic acid molecules include those set 
forth in SEQ ID NO: 2, upstream of coordinate 45163 and SEQ ID NO: 3, upstream of 
coordinate 46798. As used herein a promoter is a nucleic acid sequence that when joined 
with a coding region is capable of expressing the protein or fragment thereof so encoded. 
In a preferred embodiment the promoter sequence corresponds to between 500 
nucleotides and 5,000 nucleotides or between 300 nucleotides and 700 nucleotides of the 
nucleic acid sequence set forth in SEQ ED NO: 2 between coordinates 45163 and 40163, 
or SEQ ID NO:3 between coordinates 46798 and 41798 , or the nucleic acid sequence set 
forth in SEQ ID NO: 4 between coordinates 11 1805 and 106805 Preferred partial 
promoter regions include the TATA box region, e.g. at coordinates 44234 through 44246 
of SEQ ID NO: 2 and at coordinates 107826 through 107835 of SEQ ID NO: 4, and 
CAAT box region, e.g. at coordinates 106243 through 106259 of SEQ ID NO: 4. 

Other regulatory sequences include introns or 3' untranslated regions (3'UTRs) 
associated with rhgl and RhgA. In a preferred embodiment, an intron is selected from a 
nucleic acid comprising a sequence selected from SEQ ED NO: 2 {rhgl v.l at coordinates 
45315-45449, 45510-46940, and 48764-48974), SEQ ED NO: 3 (rhgl v.2 at coordinates 
48764-48974) and SEQ ED NO: 4 (RhgA at coordinates 113969-114683). En another 
preferred embodiment, a 3 'ITER is located within 5,000 nucleotides, more preferable 
within 1000 nucleotides in the 3' direction of the last coding nucleotide of either rhgl or 
Rhg4 (SEQ ED NO: 2, rhgl v.l, coordinate 49573, SEQ ED NO: 3, rhgl, v.2, coordinate 
49573, SEQ ED NO: 4, RhgA, coordinate 115204). 

It is understood that the present invention includes nucleic acid molecules that 
specifically hybridize or exhibit a particular identity to the nucleic acid molecules 
described in (iii). See (a) above. 
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(b) Protein and Peptide Molecules 

A class of agents comprises one or more of the protein or peptide molecules 
encoded by SEQ ID NO: 1097 through SEQ ID NO: 11 19 or one or more of the protein or 
fragment thereof or peptide molecules encoded by other nucleic acid agents of the present 
invention. As used herein, the term "protein molecule" and "peptide molecule" mean any 
protein or protein fragment or peptide or polypeptide molecule that comprises ten or more 
amino acids, preferably at least 11 or 12 or more, more preferably at least 13 or 14 amino 
acids. It is well know in the art that proteins may undergo modification, including post- 
translational modifications, such as, but not limited to, disulfide bond formation, 
glycosylation, phosphorylation, or oligomerization. Thus, as used herein, the terms 
"protein molecule" and "peptide molecule" include molecules that are modified by any 
biological or non-biological process. The terms "amino acid" and "amino acids" refer to 
all naturally occurring L-amino acids. This definition is meant to include norleucine, 
ornithine, homocysteine, and homoserine. 

One or more of the protein or peptide molecules may be produced via chemical 
synthesis, or more preferably, by expression in a suitable bacterial or eukaryotic host. 
Suitable methods for expression are described by Sambrook, et al, (In: Molecular 
Cloning, A Laboratory Manual, 2nd Edition, Cold Spring Harbor Press, Cold Spring 
Harbor, New York (1989), or similar texts. 

Another class of agents comprise protein or peptide molecules encoded by SEQ 
ID NO: 1097 through SEQ ID NO: 1119 or complements thereof or, fragments or fusions 
thereof in which non-essential, or not relevant, amino acid residues have been added, 
replaced, or deleted. An example of such a homolog is a protein homolog of each 
soybean species, including but not limited to alfalfa, barley, Brassica, broccoli, cabbage, 
citrus, garlic, oat, oilseed rape, onion, canola, flax, pea, peanut, pepper, potato, rye, 
soybean, strawberry, sugarcane, sugarbeet, soybean, maize, rice, cotton, sorghum, 
Arabidopsis, wheat, pine, fir, eucalyptus, apple, lettuce, peas, lentils, grape, banana, tea, 
turf grasses, etc. Particularly preferred non- soybean plants to utilize for the isolation of 
homologs would include alfalfa, barley, oat, oilseed rape, canola, ornamentals, sugarcane, 
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sugarbeet, soybean, maize, rice, cotton, sorghum, Arabidopsis, wheat, potato, and turf 
grasses. Such a homolog can be obtained by any of a variety of methods. Most 
preferably, as indicated above, one or more of the disclosed sequences (SEQ ID NO: 1 
through SEQ ID NO: 1096 or complements thereof) will be used to define a pair of 
primers that may be used to isolate the protein homolog-encoding nucleic acid molecules 
from any desired species. Such molecules can be expressed to yield protein homologs by 
recombinant means. 

(c) Plant Constructs and Plant Transformants 

One or more of the nucleic acid molecules of the invention may be used in plant 
transformation or transfection. Exogenous genetic material may be transferred into a 
plant cell and the plant cell regenerated into a whole, fertile or sterile plant. Exogenous 
genetic material is any genetic material, whether naturally occurring or otherwise, from 
any source that is capable of being inserted into any organism. In a preferred 
embodiment the exogenous genetic material includes a nucleic acid molecule of the 
present invention, preferably a nucleic acid molecule having at least 20 nucleotides of a 
sequence selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO: 1096 
and complements thereof. In a preferred embodiment, the nucleic acid molecule codes 
for a protein or fragment thereof described in Section (i). In another preferred 
embodiment, the nucleic acid molecule is a promoter or fragment thereof described in 
Section (iii). 

Such genetic material may be transferred into either monocotyledons and 
dicotyledons including, but not limited to tomato, eggplant, maize, soybean, Arabidopsis, 
phaseolus, peanut, alfalfa, wheat, rice, oat, sorghum, rye, tritordeum, millet, fescue, 
perennial ryegrass, sugarcane, cranberry, papaya, banana, banana, muskmelon, apple, 
cucumber, dendrobium, gladiolus, chrysanthemum, liliacea, cotton, eucalyptus, 
sunflower, canola, turfgrass, sugarbeet, coffee and dioscorea (Christou, In: Particle 
Bombardment for Genetic Engineering of Plants, Biotechnology Intelligence Unit. 
Academic Press, San Diego, California (1996). 
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In a preferred embodiment, the genetic material is transferred to a soybean. 
Preferred soybeans to transfer an rhgl SCN resistance allele are selected from the group 
consisting of PI548402 (Peking), PI200499, A2869, Jack, A2069, PI209332 (No:4), 
PI404166 (Krasnoaarmejkaja), PI404198 (Sun huan do), PI437654 (Er-hej-jan), 
PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), PI548988 (Pickett), 
PI84751, PI437654, PI40792, Pyramid, Nathan, AG2201, A3469, AG3901, A3904, 
AG4301, AG4401, AG4501, AG4601, PION9492, PI88788, Dyer, Custer, Manokin, and 
Doles. 

Preferred soybeans to transfer an RhgA SCN resistance allele are selected from 
the group consisting of PI548402 (Peking), PI437654 (Er-hej-jan), PI438489 (Chiquita), 
PI507354 (Tokei 421), PI548655 (Forrest), PI548988 (Pickett), PI88788, PI404198 (Sun 
Huan Do), PI404166 (Krasnoaarmejkaja), Hartwig, Manokin, Doles, Dyer, and Custer. 

Transfer of a nucleic acid that encodes for a protein can result in overexpression 
of that protein in a transformed cell or transgenic plant. One or more of the proteins or 
fragments thereof encoded by nucleic acid molecules of the invention may be 
overexpressed in a transformed cell or transformed plant. Such overexpression may be 
the result of transient or stable transfer of the exogenous genetic material. Such 
overexpression can also result in SCN resistance to one or more races of SCN. 

Exogenous genetic material may be transferred into a host cell by the use of a 
DNA vector or construct designed for such a purpose. Design of such a vector is 
generally within the skill of the art {See, Plant Molecular Biology: A Laboratory Manual, 
Clark (ed.), Springier, New York (1997). 

A construct or vector may include a plant promoter to express the protein or 
protein fragment of choice. A number of promoters, which are active in plant cells, have 
been described in the literature. These include the nopaline synthase (NOS) promoter 
(Ebert et ah, Proc. Natl. Acad. Sci. (U.S.A.) 84:5745-5749 (1987), the octopine synthase 
(OCS) promoter (which are carried on tumor-inducing plasmids of Agrobacterium 
tumefaciens), the caulimovirus promoters such as the cauliflower mosaic virus (CaMV) 
19S promoter (Lawton et al, Plant Mol. Biol. 9:315-324 (1987), and the CaMV 35S 
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promoter (Odell et al, Nature 313:810-812 (1985), the figwort mosaic virus 35S- 
promoter, the light-inducible promoter from the small subunit of ribulose-l,5-bis- 
phosphate carboxylase (ssRUBISCO), the Adh promoter (Walker et al, Proc. Natl. Acad. 
Sci. (U.S.A.) 54:6624-6628 (1987), the sucrose synthase promoter (Yang et al, Proc. 
Natl Acad. Sci. (U.S.A.) #7:4144-4148 (1990), the R gene complex promoter (Chandler et 
al, The Plant Cell 7:1175-1183 (1989), and the chlorophyll a/b binding protein gene 
promoter, etc. These promoters have been used to create DNA constructs that have been 
expressed in plants; see, e.g., PCT publication WO 84/02913. The CaMV 35S promoters 
are preferred for use in plants. Promoters known or found to cause transcription of DNA 
in plant cells can be used in the invention. 

For the purpose of expression in source tissues of the plant, such as the leaf, seed, 
root or stem, it is preferred that the promoters utilized have relatively high expression in 
these specific tissues. Tissue-specific expression of a protein of the present invention is a 
particularly preferred embodiment. For this purpose, one may choose from a number of 
promoters for genes with tissue- or cell-specific or -enhanced expression. Examples of 
such promoters reported in the literature include the chloroplast glutamine synthetase GS2 
promoter from pea (Edwards et al, Proc. Natl. Acad. Sci. (U.S.A.) 87:3459-3463 (1990), 
the chloroplast fructose-l,6-biphosphatase (FBPase) promoter from wheat (Lloyd et al, 
Mol. Gen. Genet. 225:209-216 (1991), the nuclear photosynthetic ST-LS1 promoter from 
potato (Stockhaus et al, EMBO J. 8:2445-2451 (1989), the STK (PAL) promoter and the 
glucoamylase (CHS) promoter from Arabidopsis thaliana. Also reported to be active in 
photosynthetically active tissues are the ribulose-l,5-bisphosphate carboxylase (RbcS) 
promoter from eastern larch (Larix laricina), the promoter for the cab gene, cab6, from 
pine (Yamamoto et al, Plant Cell Physiol. 35:773-778 (1994), the promoter for the Cab-1 
gene from wheat (Fejes et al, Plant Mol. Biol. 15:921-932 (1990), the promoter for the 
CAB-1 gene from spinach (Lubberstedt et al., Plant Physiol. 104:997-1006 (1994), the 
promoter for the cablR gene from rice (Luan et al, Plant Cell. 4:971-981 (1992), the 
pyruvate, orthophosphate dikinase (PPDK) promoter from maize (Matsuoka et al, Proc. 
Natl. Acad. Sci. (U.S.A.) 90: 9586-9590 (1993), the promoter for the tobacco Lhcbl*2 



88 



WO 01/51627 



PCT7US01/00552 



gene (Cerdan et al, Plant Mol. Biol. 33:245-255 (1997), the Arabidopsis thaliana SUC2 
sucrose-H+ symporter promoter (Truernit et al., Planta. 795:564-570 (1995), and the 
promoter for the thylakoid membrane proteins from spinach (psaD, psaF, psaE, PC, FNR, 
atpC, atpD, cab, rbcS). Other promoters for the chlorophyll a/b-binding proteins may also 
be utilized in the invention, such as the promoters for LhcB gene and PsbP gene from 
white mustard (Sinapis alba; Eretsch et al., Plant Mol. Biol. 25:219-229 (1995)). 

For the purpose of expression in sink tissues of the plant, such as the tuber of the 
potato plant, the fruit of tomato, or the seed of maize, wheat, rice and barley, it is 
preferred that the promoters utilized in the invention have relatively high expression in 
these specific tissues. A number of promoters for genes with tuber-specific or -enhanced 
expression are known, including the class I patatin promoter (Bevan et al., EMBO J. 
8: 1899-1906 (1986); Jefferson et al, Plant Mol. Biol. 74:995-1006 (1990)), the promoter 
for the potato tuber ADPGPP genes, both the large and small subunits, the sucrose 
synthase promoter (Salanoubat et al, Gene 60:47-56 (1987), Salanoubat et al, Gene 
84: 181-185 (1989)), the promoter for the major tuber proteins including the 22 kd protein 
complexes and proteinase inhibitors (Hannapel, Plant Physiol. 101:703-704 (1993)), the 
promoter for the granule bound starch synthase gene (GBSS) (Visser et al, Plant Mol. 
Biol. 77:691-699 (1991)), and other class I and II patatins promoters (Koster-Topfer et al, 
Mol Gen Genet. 279:390-396 (1989); Mignery et al, Gene. 62:27-44 (1988)). 

Other promoters can also be used to express a protein or fragment thereof in 
specific tissues, such as seeds or fruits. The promoter for P-conglycinin (Chen et al, Dev. 
Genet. 10: 112-122 (1989)) or other seed-specific promoters such as the napin and 
phaseolin promoters, can be used. The zeins are a group of storage proteins found in 
maize endosperm. Genomic clones for zein genes have been isolated (Pedersen et al. , 
Cell 29:1015-1026 (1982)) and the promoters from these clones, including the 15 kD, 16 
kD, 19 kD, 22 kD, 27 kD and genes, could also be used. Other promoters known to 
function, for example, in maize include the promoters for the following genes: waxy, 
Brittle, Shrunken 2, Branching enzymes I and U, starch synthases, debranching enzymes, 
oleosins, glutelins and sucrose synthases. A particularly preferred promoter for maize 
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endosperm expression is the promoter for the glutelin gene from rice, more particularly 
the Osgt-1 promoter (Zheng et al, Mol. Cell Biol. 75:5829-5842 (1993)). Examples of 
promoters suitable for expression in wheat include those promoters for the ADPglucose 
pyrosynthase (ADPGPP) subunits, the granule bound and other starch synthase, the 
branching and debranching enzymes, the embryogenesis-abundant proteins, the gliadins 
and the glutenins. Examples of such promoters in rice include those promoters for the 
ADPGPP subunits, the granule bound and other starch synthase, the branching enzymes, 
the debranching enzymes, sucrose synthases and the glutelins. A particularly preferred 
promoter is the promoter for rice glutelin, Osgt-1. Examples of such promoters for barley 
include those for the ADPGPP subunits, the granule bound and other starch synthase, the 
branching enzymes, the debranching enzymes, sucrose synthases, the hordeins, the 
embryo globulins and the aleurone specific proteins. 

Root specific promoters may also be used. An example of such a promoter is the 
promoter for the acid chitinase gene (Samac et al, Plant Mol. Biol. 25:587-596 (1994)). 
Expression in root tissue could also be accomplished by utilizing the root specific 
subdomains of the CaMV35S promoter that have been identified (Lam et al, Proc. Natl. 
Acad. Sci. (U.S.A.) 56:7890-7894 (1989)). Other root cell specific promoters include 
those reported by Conkling et al. (Conkling et ah, Plant Physiol. 93: 1203-121 1 (1990)). 

Additional promoters that may be utilized are described, for example, in U.S. 
Patent Nos. 5,378,619; 5,391,725; 5,428,147; 5,447,858; 5,608,144; 5,608,144; 
5,614,399; 5,633,441; 5,633,435; and 4,633,436. In addition, a tissue specific enhancer 
may be used (Fromm et al, The Plant Cell 7:977-984 (1989)). 

Preferred promoters are those set forth in Section (a)(iii) of Agents. 

Constructs or vectors may also include, with the coding region of interest, a 
nucleic acid sequence that acts, in whole or in part, to terminate transcription of that 
region. A number of such sequences have been isolated, including the Tr7 3' sequence 
and the NOS 3' sequence (Ingelbrecht et al, Tlie Plant Cell 7:671-680 (1989); Bevan et 
al, Nucleic Acids Res. 77:369-385 (1983)). 
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A vector or construct may also include regulatory elements. Examples of such 
include the Adh intron 1 (Callis et al, Genes and Develop. 1:1183-1200 (1987)), the 
sucrose synthase intron (Vasil et al, Plant Physiol. 97:1575-1579 (1989)) and the TMV 
omega element (Gallie et al., The Plant Cell 1:301-311 (1989)). These and other 
regulatory elements may be included when appropriate. 

A vector or construct may also include a selectable marker. Selectable markers 
may also be used to select for plants or plant cells that contain the exogenous genetic 
material. Examples of such include, but are not limited to: a neomycin 
phosphotransferase gene (U.S. Patent 5,034,322), which codes for kanamycin resistance 
and can be selected for using kanamycin, G418, etc.; a bar gene which codes for 
bialaphos resistance; genes which encode glyphosate resistance (U.S. Patents 4,940,835; 
5,188,642; 4,971,908; 5,627,061); a nitrilase gene which confers resistance to bromoxynil 
(Stalker et al, J. Biol Chem. 263:6310-6314 (1988)); a mutant acetolactate synthase gene 
(ALS) which confers imidazolinone or sulphonylurea resistance (European Patent 
Application 154,204 (Sept. 11, 1985)); and a methotrexate resistant DHFR gene (Thillet 
et al, J. Biol Chem. 263:12500-12508 (1988)). 

A vector or construct may also include DNA sequence which encodes a transit 
peptide. Incorporation of a suitable chloroplast transit peptide may also be employed 
(European Patent Application Publication Number 0218571). Translational enhancers 
may also be incorporated as part of the vector DNA. DNA constructs could contain one 
or more 5' non-translated leader sequences which may serve to enhance expression of the 
gene products from the resulting mRNA transcripts. Such sequences may be derived 
from the promoter selected to express the gene or can be specifically modified to increase 
translation of the mRNA. Such regions may also be obtained from viral RNAs, from 
suitable eukaryotic genes, or from a synthetic gene sequence. For a review of optimizing 
expression of transgenes, see Koziel et al, Plant Mol. Biol. 32:393-405 (1996). 

A vector or construct may also include a screenable marker. Screenable markers 
may be used to monitor expression. Exemplary screenable markers include: a p~ 
glucuronidase or uidA gene (GUS) which encodes an enzyme for which various 
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chromogenic substrates are known (Jefferson, Plant Mol. Biol, Rep. 5:387-405 (1987); 
Jefferson et al, EMBO J. 6:3901-3907 (1987)); an R-locus gene, which encodes a 
product that regulates the production of anthocyanin pigments (red color) in plant tissues 
(Dellaporta et al, Stadler Symposium ii:263-282 (1988)); a p-lactamase gene (Sutchffe 
et al, Proc. Natl. Acad. Sci. (U.SA.) 75:3737-3741 (1978)), a gene which encodes an 
enzyme for which various chromogenic substrates are known (e.g., PAD AC, a 
chromogenic cephalosporin); a luciferase gene (Ow et al, Science 234:856-859 (1986)); a 
xylE gene (Zukowsky et al, Proc. Natl. Acad. Sci. (U.SA.) 80: 1101-1 105 (1983)) which 
encodes a catechol dioxygenase that can convert chromogenic catechols; an a-amylase 
gene (Ikatu et al, Bio/Technol. 8:241-242 (1990)); a tyrosinase gene (Katz et al, J. Gen. 
Microbiol. 129:2103-2114 (1983)) which encodes an enzyme capable of oxidizing 
tyrosine to DOPA and dopaquinone which in turn condenses to melanin; an a- 
galactosidase, which will turn a chromogenic a-galactose substrate. 

Included within the terms "selectable or screenable marker genes" are also genes 
which encode a secretable marker whose secretion can be detected as a means of 
identifying or selecting for transformed cells. Examples include markers which encode a 
secretable antigen that can be identified by antibody interaction, or even secretable 
enzymes which can be detected catalytically. Secretable proteins fall into a number of 
classes, including small, diffusible proteins which are detectable, (e.g., by ELISA), small 
active enzymes which are detectable in extracellular solution (e.g., a-amylase, (3- 
lactamase, phosphinothricin transferase), or proteins which are inserted or trapped in the 
cell wall (such as proteins which include a leader sequence such as that found in the 
expression unit of extension or tobacco PR-S). Other possible selectable and/or 
screenable marker genes will be apparent to those of skill in the art. 

There are many methods for introducing transforming nucleic acid molecules into 
plant cells. Suitable methods are believed to include virtually any method by which 
nucleic acid molecules may be introduced into a cell, such as by Agrobacterium infection 
or direct delivery of nucleic acid molecules such as, for example, by PEG-mediated 
transformation, by electroporation or by acceleration of DNA coated particles, etc 
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(Potrykus, Ann. Rev. Plant Physiol. Plant Mol. Biol. 42:205-225 (1991); Vasil, Plant Mol. 
Biol. 25:925-937 (1994)). For example, electroporation has been used to transform maize 
protoplasts (Fromm et al, Nature 572:791-793 (1986)). 

Other vector systems suitable for introducing transforming DNA into a host plant 
cell include but are not limited to binary artificial chromosome (BIBAC) vectors 
(Hamilton et al, Gene 200:107-116 (1997)); and transfection with RNA viral vectors 
(Della-Cioppa et al, Ann. NY. Acad. Sci. (1996), 792 (Engineering Plants for 
Commercial Products and Applications), 57-61). Additional vector systems also include 
plant selectable YAC vectors such as those described in Mullen et al, Molecular 
Breeding 4:449-457 (1988)). 

Technology for introduction of DNA into cells is well known to those of skill in 
the art. Four general methods for delivering a gene into cells have been described: (1) 
chemical methods (Graham et al, Virology 54:536-539 (1973)); (2) physical methods 
such as microinjection (Capecchi, Cell 22:479-488 (1980)), electroporation (Wong et al, 
Biochem. Biophys. Res. Commun. 707:584-587 (1982); Fromm et al, Proc. Natl. Acad. 
Sci. (U.S.A.) 82:5824-5828 (1985); U.S. Patent No. 5,384,253); and the gene gun 
(Johnston et al, Methods Cell Biol 45:353-365 (1994)); (3) viral vectors (Clapp, Clin. 
Perinatol 20:155-168 (1993); Lu et al, J. Exp. Med. 778:2089-2096 (1993); Eglitis et 
al, Biotechniques (5:608-614 (1988)); and (4) receptor-mediated mechanisms (Curiel et 
al, Hum. Gen. Ther. 5:147-154 (1992), Wagner etal, Proc. Natl. Acad. Sci. (USA) 
89:6099-6103 (1992)). 

Acceleration methods that may be used include, for example, microprojectile 
bombardment and the like. One example of a method for delivering transforming nucleic 
acid molecules to plant cells is microprojectile bombardment. This method has been 
reviewed by Yang et al. (eds.), Particle Bombardment Technology for Gene Transfer, 
Oxford Press, Oxford, England (1994)). Non-biological particles (microprojectiles) that 
may be coated with nucleic acids and delivered into cells by a propelling force. 
Exemplary particles include those comprised of tungsten, gold, platinum and the like. 
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A particular advantage of microprojectile bombardment, in addition to it being an 
effective means of reproducibly transforming monocots, is that neither the isolation of 
protoplasts (Cristou et al, Plant Physiol. 87:611-61 A (1988)) nor the susceptibility of 
Agrobacterium infection are required. An illustrative embodiment of a method for 
delivering DNA into maize cells by acceleration is a biolistics a-particle delivery system, 
which can be used to propel particles coated with DNA through a screen, such as a 
stainless steel or Nytex screen, onto a filter surface covered with corn cells cultured in 
suspension. Gordon-Kamm et al, describes the basic procedure for coating tungsten 
particles with DNA (Gordon-Kamm et al, Plant Cell 2:603-618 (1990)). The screen 
disperses the tungsten nucleic acid particles so that they are not delivered to the recipient 
cells in large aggregates. A particle delivery system suitable for use with the invention is 
the helium acceleration PDS-1000/He gun is available from Bio-Rad Laboratories (Bio- 
Rad, Hercules, California)(Sanford et al, Technique 3:3-16 (1991)). 

For the bombardment, cells in suspension may be concentrated on filters. Filters 
containing the cells to be bombarded are positioned at an appropriate distance below the 
microprojectile stopping plate. If desired, one or more screens are also positioned 
between the gun and the cells to be bombarded. 

Alternatively, immature embryos or other target cells may be arranged on solid 
culture medium. The cells to be bombarded are positioned at an appropriate distance 
below the microprojectile stopping plate. If desired, one or more screens are also 
positioned between the acceleration device and the cells to be bombarded. Through the 
use of techniques set forth herein one may obtain up to 1000 or more foci of cells 
transiently expressing a screenable or selectable marker gene. The number of cells in a 
focus which express the exogenous gene product 48 hours post-bombardment often range 
from one to ten and average one to three. 

In bombardment transformation, one may optimize the pre-bombardment 
culturing conditions and the bombardment parameters to yield the maximum numbers of 
stable transformants. Both the physical and biological parameters for bombardment are 
important in this technology. Physical factors are those that involve manipulating the 
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DNA/microprojectile precipitate or those that affect the flight and velocity of either the 
macro- or microprojectiles. Biological factors include all steps involved in manipulation 
of cells before and immediately after bombardment, the osmotic adjustment of target cells 
to help alleviate the trauma associated with bombardment and also the nature of the 
transforming DNA, such as linearized DNA or intact supercoiled plasmids. It is believed 
that pre-bombardment manipulations are especially important for successful 
transformation of immature embryos. 

In another alternative embodiment, plastids can be stably transformed. Methods 
disclosed for plastid transformation in higher plants include the particle gun delivery of 
DNA containing a selectable marker and targeting of the DNA to the plastid genome 
through homologous recombination (Svab etal, Proc. Natl. Acad. Sci. (U.S.A.) 87:8526- 
8530 (1990); Svab et al, Proc. Natl Acad. Sci. (U.S.A.) 90:913-917 (1993); Staub et al, 
EMBO J. 12:601-606 (1993); U.S. Patents 5, 451,513 and 5,545,818). 

Accordingly, it is contemplated that one may wish to adjust various aspects of the 
bombardment parameters in small-scale studies to fully optimize the conditions. One 
may particularly wish to adjust physical parameters such as gap distance, flight distance, 
tissue distance and helium pressure. One may also minimize the trauma reduction factors 
by modifying conditions which influence the physiological state of the recipient cells and 
which may therefore influence transformation and integration efficiencies. For example, 
the osmotic state, tissue hydration and the subculture stage or cell cycle of the recipient 
cells may be adjusted for optimum transformation. The execution of other routine 
adjustments will be known to those of skill in the art in light of the present disclosure. 

Agrobacterium-mediated transfer is a widely applicable system for introducing 
genes into plant cells because the DNA can be introduced into whole plant tissues, 
thereby bypassing the need for regeneration of an intact plant from a protoplast. The use 
of Agrobacterium-mediatQd plant integrating vectors to introduce DNA into plant cells is 
well known in the art. See, for example the methods described by Fraley et al, 
Bio/Technology 3:629-635 (1985) and Rogers et al, Methods Enzymol 755:253-277 
(1987). Further, the integration of the T-DNA is a relatively precise process resulting in 
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few rearrangements. The region of DNA to be transferred is defined by the border 
sequences and intervening DNA is usually inserted into the plant genome as described 
(Spielmann et al, Mol. Gen. Genet. 205:34 (1986)). 

Modern Agrobacterium transformation vectors are capable of replication in E. 
coli as well as Agrobacterium, allowing for convenient manipulations as described (Klee 
et al, In: Plant DNA Infectious Agents, Hohn et al. (eds.), Springer-Verlag, New York, 
pp. 179-203 (1985)). Moreover, technological advances in vectors for Agrobacterium- 
mediated gene transfer have improved the arrangement of genes and restriction sites in 
the vectors to facilitate construction of vectors capable of expressing various polypeptide 
coding genes. The vectors described have convenient multi-linker regions flanked by a 
promoter and a polyadenylation site for direct expression of inserted polypeptide coding 
genes and are suitable for present purposes (Rogers et al, Methods Enzymol. 753:253-277 
(1987)). In addition, Agrobacterium containing both armed and disarmed Ti genes can be 
used for the transformations, hi those plant strains where Agrobacterium-msdiated 
transformation is efficient, it is the method of choice because of the facile and defined 
nature of the gene transfer. 

A transgenic plant formed using Agrobacterium transformation methods typically 
contains a single gene on one chromosome. Such transgenic plants can be referred to as 
being heterozygous for the added gene. More preferred is a transgenic plant that is 
homozygous for the added structural gene; i.e., a transgenic plant that contains two added 
genes, one gene at the same locus on each chromosome of a chromosome pair. A 
homozygous transgenic plant can be obtained by sexually mating (selfmg) an independent 
segregant transgenic plant that contains a single added gene, germinating some of the 
seed produced and analyzing the resulting plants produced for the gene of interest. 

It is also to be understood that two different transgenic plants can also be mated 
to produce offspring that contain two independently segregating, exogenous genes. 
Selfing of appropriate progeny can produce plants that are homozygous for both added, 
exogenous genes that encode a polypeptide of interest. Backcrossing to a parental plant 
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and out-crossing with a non-transgenic plant are also contemplated, as is vegetative 
propagation. 

Transformation of plant protoplasts can be achieved using methods based on 
calcium phosphate precipitation, polyethylene glycol treatment, electroporation and 
combinations of these treatments (See, for example, Potrykus et al, Mol. Gen. Genet. 
205:193-200 (1986); Lorz et al, Mol. Gen. Genet. 199:11% (1985); Fromm et al, Nature 
319:191 (1986); Uchimiya et al, Mol. Gen. Genet. 204:204 (1986); Marcotte et al, 
Nature 335:454-457 (1988)). 

Application of these systems to different plant strains depends upon the ability to 
regenerate that particular plant strain from protoplasts. Illustrative methods for the 
regeneration of cereals from protoplasts are described (Fujimura et al, Plant Tissue 
Culture Letters 2:1 A (1985); Toriyama et al, TheorAppl. Genet. 205:34 (1986); Yamada 
et al, Plant Cell Rep. 4:S5 (1986); Abdullah et al, Biotechnology 4: 1087 (1986)). 

To transform plant strains that cannot be successfully regenerated from 
protoplasts, other ways to introduce DNA into intact cells or tissues can be utilized. For 
example, regeneration of cereals from immature embryos or explants can be effected as 
described (Vasil, Biotechnology 6:391 (1988)). In addition, "particle gun" or high- 
velocity microprojectile technology can be utilized (Vasil et al, Bio/Technology 10:661 
(1992)). 

Using the latter technology, DNA is carried through the cell wall and into the 
cytoplasm on the surface of small metal particles as described (Klein et al, Nature 
328:10 (1987); Klein et al, Proc. Natl. Acad. Sci. (U.S.A.) §5:8502-8505 (1988); 
McCabe et al, Bio/Technology 6:923 (1988)). The metal particles penetrate through 
several layers of cells and thus allow the transformation of cells within tissue explants. 

The regeneration, development and cultivation of plants from single plant 
protoplast transformants or from various transformed explants are well known in the art 
(Weissbach et al, In: Methods for Plant Molecular Biology , Academic Press, San Diego, 
CA, (1988)). This regeneration and growth process typically includes the steps of 
selection of transformed cells, culturing those individualized cells through the usual 
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stages of embryonic development through the rooted plantlet stage. Transgenic embryos 
and seeds are similarly regenerated. The resulting transgenic rooted shoots are thereafter 
planted in an appropriate plant growth medium such as soil. 

The development or regeneration of plants containing the foreign, exogenous 
gene that encodes a protein of interest is well known in the art. Preferably, the 
regenerated plants are self-pollinated to provide homozygous transgenic plants. 
Otherwise, pollen obtained from the regenerated plants is crossed to seed-grown plants of 
agronomically important lines. Conversely, pollen from plants of these important lines is 
used to pollinate regenerated plants. A transgenic plant of the invention containing a 
desired polypeptide is cultivated using methods well known to one skilled in the art. 

There are a variety of methods for the regeneration of plants from plant tissue. 
The particular method of regeneration will depend on the starting plant tissue and the 
particular plant species to be regenerated. 

Methods for transforming dicots, primarily by use of Agrobacteriurn tumefaciens 
and obtaining transgenic plants have been published for cotton (U.S. Patent No. 
5,004,863; U.S. Patent No. 5,159,135; U.S. Patent No. 5,518,908); soybean (U.S. Patent 
No. 5,569,834; U.S. Patent No. 5,416,011; McCabe et al, Biotechnology 6:923 (1988); 
Christou et al, Plant Physiol. 87:611-61 A (1988)); Brassica (U.S. Patent No. 5,463,174); 
peanut (Cheng et al, Plant Cell Rep. 15:653-657 (1996), McKently et al, Plant Cell Rep. 
14:699-103 (1995)); papaya; and pea (Grant et al, Plant Cell Rep. 75:254-258 (1995)). 

Transformation of monocotyledons using electroporation, particle bombardment 
and Agrobacteriurn have also been reported. Transformation and plant regeneration have 
been achieved in asparagus (Bytebier et al, Proc. Natl. Acad. Sci. (USA) 84:5354 
(1987)); barley (Wan et al, Plant Physiol 104:31 (1994)); maize (Rhodes etal, Science 
240:204 (1988); Gordon-Kamm et al, Plant Cell 2:603-618 (1990); Fromm et al, 
Bio/Technology 5:833 (1990); Koziel etal, Bio/Technology 11:194 (1993); Armstrong et 
al, Crop Science 35:550-557 (1995)); oat (Somers et al, Bio/Technology 10:1589 
(1992)); orchard grass (Horn et al, Plant Cell Rep. 7:469 (1988)); rice (Toriyama et al, 
TheorAppl. Genet. 205:34 (1986); Part etal, Plant Mol. Biol. 32:1135-1148 (1996); 
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Abedinia et al, Aust. J. Plant Physiol. 24:133-141 (1997); Zhang et al, Theor. Appl. 
Genet. 76:835 (1988); Zhang et al, Plant Cell Rep. 7:379 (1988); Battraw et al, Plant 
Sci. 86: 191-202 (1992); Christou et al., Bio/Technology 9:957 (1991)); rye (De la Pena et 
al, Nature 325:21 A (1987)); sugarcane (Bower et al., Plant J. 2:409 (1992)); tall fescue 
(Wang et al, Bio/Technology 70:691 (1992)) and wheat (Vasil et al, Bio/Technology 
10:667 (1992); U.S. Patent No. 5,631,152). 

Assays for gene expression based on the transient expression of cloned nucleic 
acid constructs have been developed by introducing the nucleic acid molecules into plant 
cells by polyethylene glycol treatment, electroporation, or particle bombardment 
(Marcotte et al, Nature 335:454-457 (1988); Marcotte et al, Plant Cell 1. "523-532 
(1989); McCarty et al, Cell 66:895-905 (1991); Hattori et al, Genes Dev. 6:609-618 
(1992); Goff et al, EMBO J. 9:2517-2522 (1990)). Transient expression systems may be 
used to functionally dissect gene constructs (see generally, Mailga et al, Methods in 
Plant Molecular Biology, Cold Spring Harbor Press (1995)). 

Any of the nucleic acid molecules of the invention may be introduced into a plant 
cell in a permanent or transient manner in combination with other genetic elements such 
as vectors, promoters, enhancers, etc. Further, any of the nucleic acid molecules of the 
invention may be introduced into a plant cell in a manner that allows for overexpression 
of the protein or fragment thereof encoded by the nucleic acid molecule. 

Cosuppression is the reduction in expression levels, usually at the level of RNA, 
of a particular endogenous gene or gene family by the expression of a homologous sense 
construct that is capable of transcribing mRNA of the same strandedness as the transcript 
of the endogenous gene (Napoli et al, Plant Cell 2:279-289 (1990); van der Krol et al, 
Plant Cell 2:291-299 (1990)). Cosuppression may result from stable transformation with 
a single copy nucleic acid molecule that is homologous to a nucleic acid sequence found 
within the cell (Prolls et al, Plant J. 2:465-475 (1992)) or with multiple copies of a 
nucleic acid molecule that is homologous to a nucleic acid sequence found within the cell 
(Mittlesten et al, Mol. Gen. Genet. 244:325-330 (1994)). Genes, even though different, 
linked to homologous promoters may result in the cosuppression of the linked genes 
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(Vaucheret, C.R. Acad. Set 7/7376:1471-1483 (1993); Flavell, Proc. Natl. Acad. Sci. 
(U.S.A.) 97:3490-3496 (1994)); van Blokland et al, Plant J. 6:861-877 (1994); 
Jorgensen, Trends Biotechnol. 5:340-344 (1990); Meins et al, In: Gene Inactivation and 
Homologous Recombination in Plants, Paszkowski (ed.), pp. 335-348, Kluwer Academic, 
Netherlands (1994)). 

It is understood that one or more of the nucleic acids of the invention may be 
introduced into a plant cell and transcribed using an appropriate promoter with such 
transcription resulting in the cosuppression of an endogenous protein. 

Antisense approaches are a way of preventing or reducing gene function by 
targeting the genetic material (U.S. Patents 4,801,540 and 5,107,065 Mol et al, FEBS 
Lett. 268:427-430 (1990)). The objective of the antisense approach is to use a sequence 
complementary to the target gene to block its expression and create a mutant cell line or 
organism in which the level of a single chosen protein is selectively reduced or abolished. 
Antisense techniques have several advantages over other 'reverse genetic' approaches. 
The site of inactivation and its developmental effect can be manipulated by the choice of 
promoter for antisense genes or by the timing of external application or microinjection. 
Antisense can manipulate its specificity by selecting either unique regions of the target 
gene or regions where it shares homology to other related genes (Hiatt et al, In: Genetic 
Engineering, Setlow (ed.), Vol. 11, New York: Plenum 49-63 (1989)). 

The principle of regulation by antisense RNA is that RNA that is complementary 
to the target mRNA is introduced into cells, resulting in specific RNA:RNA duplexes 
being formed by base pairing between the antisense substrate and the target mRNA 
(Green etal.,Annu. Rev. Biochem. 55:569-597 (1986)). Under one embodiment, the 
process involves the introduction and expression of an antisense gene sequence. Such a 
sequence is one in which part or all of the normal gene sequences are placed under a 
promoter in inverted orientation so that the 'wrong' or complementary strand is 
transcribed into a noncoding antisense RNA that hybridizes with the target mRNA and 
interferes with its expression (Takayama et al, Crit. Rev. Biochem. Mol. Biol. 25: 155-184 
(1990)). An antisense vector is constructed by standard procedures and introduced into 
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cells by transfonnation, transfection, electroporation, microinjection, infection, etc. The 
type of transformation and choice of vector will determine whether expression is transient 
or stable. The promoter used for the antisense gene may influence the level, timing, 
tissue, specificity, or inducibility of the antisense inhibition. 

It is understood that the activity of a protein in a plant cell may be reduced or 
depressed by growing a transformed plant cell containing a nucleic acid molecule whose 
non-transcribed strand encodes a protein or fragment thereof. 

Post transcriptional gene silencing (PTGS) can result in virus immunity or gene 
silencing in plants. PTGS is induced by dsRNA and is mediated by an RNA-dependent 
RNA polymerase, present in the cytoplasm, that requires a dsRNA template. The dsRNA 
is formed by hybridization of complementary transgene mRNAs or complementary 
regions of the same transcript. Duplex formation can be accomplished by using 
transcripts from one sense gene and one antisense gene co-located in the plant genome, a 
single transcript that has self-complementarity, or sense and antisense transcripts from 
genes brought together by crossing. The dsRNA-dependent RNA polymerase makes a 
complementary strand from the transgene mRNA and RNAse molecules attach to this 
complementary strand (cRNA). These cRNA-RNAse molecules hybridize to the 
endogene mRNA and cleave the single-stranded RNA adjacent to the hybrid. The 
cleaved single-stranded RNAs are further degraded by other host RNAses because one 
will lack a capped 5' end and the other will lack a poly(A) tail (Waterhouse et al, PNAS 
95: 13959-13964 (1998)). 

It is understood that one or more of the nucleic acids of the invention may be 
introduced into a plant cell and transcribed using an appropriate promoter with such 
transcription resulting in the postranscriptional gene silencing of an endogenous 
transcript. 

Antibodies have been expressed in plants (Hiatt et al, Nature 342:16-78 (1989); 
Conrad et al, Plant Mol. Biol. 26:1023-1030 (1994)). Cytoplasmic expression of a scFv 
(single-chain Fv antibodies) has been reported to delay infection by artichoke mottled 
crinkle virus. Transgenic plants that express antibodies directed against endogenous 



101 



WO 01/51627 



PCT7US01/00552 



proteins may exhibit a physiological effect (Philips et al, EMBO J. 76:4489-4496 (1997); 
Marion-Poll, Trends in Plant Science 2:447-448 (1997)). For example, expressed anti- 
abscissic antibodies have been reported to result in a general perturbation of seed 
development (Philips et al, EMBO J. 16: 4489-4496 (1997)). 

Antibodies that are catalytic may also be expressed in plants (abzymes). The 
principle behind abzymes is that since antibodies may be raised against many molecules, 
this recognition ability can be directed toward generating antibodies that bind transition 
states to force a chemical reaction forward (Persidas, Nature Biotechnology 75:1313-1315 
(1997); Baca et al, Ann. Rev. Biophys. Biomol. Struct. 26:461-493 (1997)). The catalytic 
abilities of abzymes may be enhanced by site directed mutagenesis. Examples of 
abzymes are, for example, set forth in U.S. Patent No: 5,658,753; U.S. Patent No. 
5,632,990; U.S. Patent No. 5,631,137; U.S. Patent 5,602,015; U.S. Patent No. 5,559,538; 
U.S. Patent No. 5,576,174; U.S. Patent No. 5,500,358; U.S. Patent 5,318,897; U.S. Patent 
No. 5,298,409; U.S. Patent No. 5,258,289 and U.S. Patent No. 5,194,585. 

It is understood that any of the antibodies of the invention may be expressed in 
plants and that such expression can result in a physiological effect. It is also understood 
that any of the expressed antibodies may be catalytic. 

(d) Antibodies 

One aspect of the present invention concerns antibodies, single-chain antigen 
binding molecules, or other proteins that specifically bind to one or more of the protein or 
peptide molecules of the present invention and their homologues, fusions or fragments. 
Such antibodies may be used to quantitatively or qualitatively detect the protein or 
peptide molecules of the present invention. As used herein, an antibody or peptide is said 
to "specifically bind" to a protein or peptide molecule of the present invention if such 
binding is not competitively inhibited by the presence of non-related molecules. 

Nucleic acid molecules that encode all or part of the protein of the present 
invention can be expressed, via recombinant means, to yield protein or peptides that can 
in turn be used to elicit antibodies that are capable of binding the expressed protein or 
peptide. Such antibodies may be used in immunoassays for that protein. Such protein- 
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encoding molecules, or their fragments may be a "fusion" molecule (i.e., a part of a larger 
nucleic acid molecule) such that, upon expression, a fusion protein is produced. It is 
understood that any of the nucleic acid molecules of the present invention may be 
expressed, via recombinant means, to yield proteins or peptides encoded by these nucleic 
acid molecules. 

The antibodies that specifically bind proteins and protein fragments of the present 
invention may be polyclonal or monoclonal and may comprise intact immunoglobulins, 
or antigen binding portions of immunoglobulins fragments (such as (F(ab'), F(ab')2), or 

single-chain immunoglobulins producible, for example, via recombinant means. It is 
understood that practitioners are familiar with the standard resource materials which 
describe specific conditions and procedures for the construction, manipulation and 
isolation of antibodies (see, for example, Harlow et al, In: Antibodies: A Laboratory 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, New York (1988)). 

Murine monoclonal antibodies are particularly preferred. BALB/c mice are 
preferred for this purpose, however, equivalent strains may also be used. The animals are 
preferably immunized with approximately 25 ug of purified protein (or fragment thereof) 
that has been emulsified in a suitable adjuvant (such as TiterMax adjuvant (Vaxcel, 
Norcross, GA)). Immunization is preferably conducted at two intramuscular sites, one 
intraperitoneal site and one subcutaneous site at the base of the tail. An additional i.v. 
injection of approximately 25 pg of antigen is preferably given in normal saline three 
weeks later. After approximately 1 1 days following the second injection, the mice may 
be bled and the blood screened for the presence of anti-protein or peptide antibodies. 
Preferably, a direct binding Enzyme-Linked Immunoassay (ELISA) is employed for this 
purpose. 

More preferably, the mouse having the highest antibody titer is given a third i.v. 
injection of approximately 25 ug of the same protein or fragment. The splenic leukocytes 
from this animal may be recovered 3 days later and then permitted to fuse, most 
preferably, using polyethylene glycol, with cells of a suitable myeloma cell line (such as, 
for example, the P3X63Ag8.653 myeloma cell line). Hybridoma cells are selected by 
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culturing the cells tinder "HAT" (hypoxanthine-arninopterin^hyrnine) selection for about 
one week. The resulting clones may then be screened for their capacity to produce 
monoclonal antibodies ("mAbs"), preferably by direct ELISA. 

In one embodiment, anti-protein or peptide monoclonal antibodies are isolated 
using a fusion of a protein or peptide of the present invention, or conjugate of a protein or 
peptide of the present invention, as immunogens. Thus, for example, a group of mice can 
be immunized using a fusion protein emulsified in Freund's complete adjuvant (e.g., 
approximately 50 ug of antigen per immunization). At three week intervals, an identical 
amount of antigen is emulsified in Freund's incomplete adjuvant and used to immunize 
the animals. Ten days following the third immunization, serum samples are taken and 
evaluated for the presence of antibody. If antibody titers are too low, a fourth booster can 
be employed. Polysera capable of binding the protein or peptide can also be obtained 
using this method. 

In a preferred procedure for obtaining monoclonal antibodies, the spleens of the 
above-described immunized mice are removed, disrupted and immune splenocytes are 
isolated over a ficoll gradient. The isolated splenocytes are fused, using polyethylene 
glycol with B ALB/c-derived HGPRT (hypoxanthine guanine phosphoribosyl transferase) 
deficient P3x63xAg8.653 plasmacytoma cells. The fused cells are plated into 96 well 
microliter plates and screened for hybridoma fusion cells by their capacity to grow in 
culture medium supplemented with hypothanthine, aminopterin and thymidine for 
approximately 2-3 weeks. 

Hybridoma cells that arise from such incubation are preferably screened for then- 
capacity to produce an immunoglobulin that binds to a protein of interest. An indirect 
ELISA may be used for this purpose. In brief, the supernatants of hybridomas are 
incubated in microliter wells that contain immobilized protein. After washing, the titer of 
bound immunoglobulin can be determined using, for example, a goat anti-mouse antibody 
conjugated to horseradish peroxidase. After additional washing, the amount of 
immobilized enzyme is determined (for example through the use of a chromogenic 
substrate). Such screening is performed as quickly as possible after the identification of 
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the hybridoma in order to ensure that a desired clone is not overgrown by non-secreting 
neighbor cells. Desirably, the fusion plates are screened several times since the rates of 
hybridoma growth vary. In a preferred sub-embodiment, a different antigenic form may 
be used to screen the hybridoma. Thus, for example, the splenocytes may be immunized 
with one immunogen, but the resulting hybridomas can be screened using a different 
immunogen. It is understood that any of the protein or peptide molecules of the present 
invention may be used to raise antibodies. 

Such antibody molecules or their fragments may be used for diagnostic purposes. 
Where the antibodies are intended for diagnostic purposes, it may be desirable to 
derivatize them, for example with a ligand group (such as biotin) or a detectable marker 
group (such as a fluorescent group, a radioisotope or an enzyme). 

The ability to produce antibodies that bind the protein or peptide molecules of the 
present invention permits the identification of mimetic compounds of those molecules. A 
"mimetic compound" is a compound that is not that compound, or a fragment of that 
compound, but which nonetheless exhibits an ability to specifically bind to antibodies 
directed against that compound. 

Having now generally described the invention, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

EXAMPLE 1 

In this example, DNA is extracted from soybean plants, amplified, and mapped. 

A single trifoliate leaf is collected from the newest growth of four week old 
soybean plants. Leaf tissue from the leaf is placed on ice and stored at -80°C. The frozen 
tissue is lyophilized, and approximately 0.01 grams of the tissue is used for DNA 
extraction. The 0.01 grams of leaf tissue is ground to powder in 1.4 ml tubes. 600 
microliters (pi) of DNA extraction buffer consisting of 0.5M NaCl, 0.1M Tris- 
(hydroxymethyl) aminomethane pH 8.0, 0.05 M ethylenediaminetetra-acetic acid 
(EDTA), 10.0 g L" 1 sodium dodecyl sulfate (SDS), and 2 g L" 1 phenantroline (dissolved in 
0.01 L ethanol) is heated to 65°C (with 0.77 g L" 1 dithiothreitol added immediately before 
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use) is added to each tube, and each tube is mixed thoroughly. The samples are placed in 
a 65°C water bath for 15 minutes and shaken by hand after 10 minutes. The samples are 
taken out of the water bath and cooled to room temperature, and then 200 ul of 5 M 
KOAc is added to each tube. The samples are inverted and placed at 4°C for 20 minutes. 
Samples are then centrifuged for 12 minutes at 6200 X g and the supernatant (about 600 
ul) is transferred to new tubes. DNA is precipitated with 330 ul of cold isopropanol and 
placed at -20°C for 1 hr. The DNA is pelleted by centrifuging at 6200 X g for 10 minutes 
and washed with 70% EtOH. The DNA is pelleted by centrifugation at 6200 X g for 10 
minutes and dried using a Speed- Vac. The DNA is dissolved in 100 ul of TE 0 .i (0.01 M 
Tris-HCl pH 8.0, 0.0001 M EDTA). The extraction will generally yield 500 ng DNA ul" 1 . 

A polymerase chain reaction (PCR) is conducted with 5 to 10 ng genomic DNA 
in 10 ul volumes of 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 0.001% gelatin, 1.5 mM 
MgCl 2 , 0. 1 mM of each dNTP, 150 nM of each primer, 0.01 mM Cresol Red, 2% sucrose 
and 0.32 units of AmpliTaq DNA Polymerase (Perkin Elmer Instruments Inc., USA). For 
thermocycling, the Gene Amp PCR System 9700 (Perkin Elmer Instruments Inc., USA) is 
used with one step of 94°C for 3 minutes, then 32 cycles of 94°C, 47°C, and 72°C steps 
of 25 sec each and one final step of 72°C for 3 minutes. The PCR products are run on a 
6% polyacrylamide gel (30 cm X 8 cm X 1 mm) in IX TAE (40 mM Tris-HCl, pH 8.3, 1 
mM EDTA) at 180 v for 45 minutes. The gels are stained using SYBR Gold (Molecular 
Probes, Eugene, OR) according to the manufacturer's instructions. 

SSR primer screening for polymorphism is performed using PIC, HS-1, Will and 
PI507354 genotypes. SSRs that are polymorphic and easy to score (i.e., clear banding 
pattern and good separation between alleles) are mapped using the HS-1 x PIC (F2) 
and/or Will x PI507354 (RTL) mapping populations. At least one SSR per BAC sequence 
is mapped. DNA markers that exhibited codominant banding patterns are scored as 
homozygous for one or the other parent or as heterozygous, exhibiting both parental 
alleles. Marker scores are checked for segregation distortion using the chi-squared test 
for goodness of fit to expected ratios. Linkage relationships are determined using 
Mapmaker Version 3.0b with a LOD of 3.0 (Whitehead Institute, Cambridge, MA). 
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EXAMPLE 2 

DNA fragments containing candidates for genes rhgl and Rhg4 from susceptible 
and resistant soybean lines are subcloned into a TA cloning plasmid (TOPO TA Cloning 
Kit, Version E, Invitrogen Corporation, 1600 Faraday Avenue, Carlsbad, CA). 

Genomic DNA from 24 susceptible and 9 resistant lines is isolated using standard 
techniques. Approximately 500 nanograms (ng) of DNA is used for PCR amplification. 
Resistant BAC DNA is isolated by using AUTOGEN (AutoGen Corp., 35 Loring Drive 
Framingham, MA). PCR amplification is then performed using 0.1- 0.2 ng of resistant 
BAC DNA. The primers that are used to amplify candidate rhgl genes PCR are as 
follows: 

Fragment I (2,892 bp) primer (SEQ ID NO: 25), GCA ATA CTT GAA GGA 
ATA TGT CCA C; primer (SEQ ID NO: 24), beginning at start codon, ATG GAT GGT 
AAA AAT TCA AAA CTA AAC; modified reverse primer 1 (SEQ ID NO: 1123), 
beginning 5 bp before start codon; GTT GTA TGG ATG GTA AAA ATT CAA AAC. 
Fragment II (1,746 bp) reverse primer 2 (SEQ ID NO: 27), ending at 13 bp after stop 
codon, GAC TGG CTG TGA CTG ATC TCT CT; primer 2 (SEQ ID NO: 26), CTC ACT 
TAC ACT GCT GAA TGC AGA. 

The primers for Rgh4 PCR are as follows: 

Forward primer (SEQ ID NO: 48), ATG TCT CTC CCC AAA ACC CTA CTT 
TCT CTC; reverse primer (SEQ ID NO: 49), ending at 2 bp after stop codon, GGT TAA 
CGG CAA TCC ATT GAA TCA AAG GAG. 

PCR amplification is performed in an MJ Research PTC DNA Engine TM 
System, Model PTC-225 (MJ Research Jnc, 590 Lincoln Street Waltham, MA). PCR is 
performed using the following components: lul DNA, 5ul lOx buffer, lul primer 1, lul 
primer 2, lul lOmM dNTP, 1.5 ul 50mM MgCl 2 , 0.2ul Taq. (Platinum), 39.3ul H z O. The 
PCR program used is as follows: 95°C for 10 minutes (step 1), 95°C for 30 seconds (step 
2), 70°C for 30 seconds/-l°C per cycle/72°C for 3 minutes (step 3), repeat steps two 
through three 9 times (step 4), 95°C for 30 seconds (step 5), 60°C for 30 seconds (step 6), 
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72°C for 3 minutes (step 7), repeat steps five through seven 34 times (step 8), 4°C forever 
(step 9), end. 

PCR products are separated on 1% agarose gel by electrophoresis. A single DNA 
band is excised from gel. Gel extraction is done using CLONTECH NucleoSpin 
Extraction Kit (Clonetech Laboratories Inc., 1020 East Meadow Circle, Palo Alto, CA). 2 
ul of purified DNA is loaded on 1% agarose gel to check concentration. 40-100ng of 
DNA is used for subcloning. 

A TOPO cloning reaction is done according to the following: 4pl of fresh PCR 
product, lpl Clontech Salt Solution, and 1 pi TOPO vector. The solution is mixed gently, 
incubated for 10 minutes at room temperature, and then placed on ice. 

A one shot chemical transformation is performed as follows. 2pl of the TOPO 
Cloning reaction is added to a vial of TOP 10 One Shot Chemically Competent E. coli 
and mixed gently. The mixture is then Incubated on ice for 30 minutes. The cells are 
then heat-shocked for 30 seconds at 42°C, and immediately transferred to ice. 250 pi of 
SOC medium is then added, and the mixture is incubated at 37°C for 1 hour. 80 pi is then 
spread onto a selective plate, and 170 pi is spread onto another plate. The plates are 
incubated at 37°C for 18-20 hours. The selective plates are LB agar plates with 100 
pg/ml ampicillin, 40 pg/ml IPTG, and 40 pg/ml X-GAL. 

After incubation, 8-10 white or light blue colonies are selected. The positive 
colonies are inoculated into LB medium containing 50 pg/ml ampicillin and incubated at 
37°C overnight. Sterilized glycerol is added to make 15% glycerol stock, which can be 
stored at -80°C. 

Sanger sequencing reactions are performed on subclones using BigDye 
Terminators (Applied Biosystems, 850 Lincoln Centre Drive, Foster City, CA) and then 
analyzed on ABI 377/ABI 3700 automated sequencing machines (Applied Biosystems, 
850 Lincoln Centre Drive, Foster City, CA). The sequences are evaluated for quality and 
error probability using the program, PHRED (Ewing and Green, Genome Res., 8:186-194 
(1998), Ewing et al, Genome Res., 8:175-185, (1998)), assembled using the phrap 
assembler and viewed using consed (Gordon et al., Genome Res., 8: 195-202). An rhgl 
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candidate gene is found in BAC 240017, and is about 4.5 kb in size. An RhgA candidate 
was found in BAC 318013, and is about 3.5 kb in size. 

EXAMPLE 3 

The physical mapping of a QTL (quantitative trait locus) is described in this 
example. Mapping is initiated with linkage analysis of SSR (simple sequence repeats) 
markers. Markers that are shown to be linked to the QTL of interest are used to PCR 
screen the soy BAC library and identify candidate BACs. Confirmed BACs are 
subcloned and sequenced, BAC-end sequenced, and fingerprinted. New markers are 
designed from good BAC-end sequences and used to screen the library, by either PCR or 
hybridization to high density grid filters, in order to extend the contigs. A BAC-end 
sequence and fingerprint database of soy BACs is used in conjunction with the above 
methods to help build and extend contigs. Sequenced BACs are aligned, and overlapping 
BACs are placed into contigs. These contigs, which contain unique sequences, are put 
into an ACEDB database, and predicted genes are annotated by hand using various 
programs. Candidates genes (for the gene of interest) are subcloned from genomic DNA 
of different lines by PCR using primers from outside the predicted coding regions. These 
subclones are sequenced and screened for SNPs (single nucleotide polymorphisms) and 
INDELs (insertions/deletions), and different haplotypes of the lines with and without the 
desired phenotype are examined for correlations between the haplotype and phenotype. 

A single trifoliate leaf is collected from the newest growth of four week old 
soybean plants. The leaf tissue is placed on ice and stored at -80°C. The frozen tissue is 
lyophilized and approximately 0.01 grams of tissue is used for DNA extraction. The leaf 
tissue is ground to powder in 1.4 ml tubes and 600 ul of DNA extraction buffer [0.5M 
NaCl, 0.1M Tris-(hydroxymethyl) aminomethane pH 8.0, 0.05 M ethylenediaminetetra- 
acetic acid (EDTA), 10.0 g L" 1 sodium dodecyl sulfate (SDS), 2 g L" 1 phenantroline 
(dissolved in 0.01 L ethanol)] heated to 65°C (with 0.77 g L" 1 dithiothreitol added 
immediately before use) is added to each tube and mixed thoroughly. The samples are 
placed in a 65°C water bath for 15 minutes and shaken by hand after 10 min. The 
samples are taken out of the water bath, cooled to room temperature, and 200 ul of 5 M 
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KOAc is added to each tube. The samples are inverted and placed at 4°C for 20 min. 
Samples are then centrifuged for 12 minutes at 6200 X g and the supernatant (about 600 
ul) is transferred to new tubes. DNA is precipitated with 330 ul of cold isopropanol and 
placed at -20°C for 1 hr. The DNA is pelleted by centrifuging at 6200 X g for 10 minutes 
and is washed with 70% EtOH. The DNA is pelleted by centrifugation at 6200 X g for 10 
minutes and dried using a Speed-Vac. The DNA is dissolved in 100 ul of TE 0 .i (0.01 M 
Tris-HCl pH 8.0, 0.0001 M EDTA). The extraction yields 500 ng DNA (if 1 . 

The polymerase chain reaction (PCR) is conducted with 5 to 10 ng genomic DNA 
in 10 ul volumes of 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 0.001% gelatin, 1.5 mM 
MgCl 2 , 0. 1 mM of each dNTP, 150 nM of each primer, 0.01 mM Cresol Red, 2% sucrose 
and 0.32 units of AmpliTaq DNA Polymerase (Perkin Elmer Instruments Inc., USA, 761 
Main Avenue, Norwalk, CT). For thermocycling, the Gene Amp PCR System 9700 
(Perkin Elmer Instruments Inc., USA, 761 Main Avenue, Norwalk, CT) is used with one 
step of 94°C for 3 min, then 32 cycles of 94°C, 47°C and 72°C steps of 25 sec each and 
one final step of 72°C for 3 min. The PCR products are run on a 6% polyacrylamide gel 
(30 cm X 8 cm X 1 mm) in IX TAE (40 mM Tris-HCl, pH 8.3, 1 mM EDTA) at 180v for 
45 min. The gels are stained using SYBR Gold (Molecular Probes, Eugene, OR) per 
manufacturers instructions. 

SSR primer screening for polymorphisms is performed using PIC, HS-1, Will and 
PI507354 genotypes. SSRs that are polymorphic and easy to score (i.e., Clear banding 
pattern and good separation between alleles) are mapped using the HS-1 x PIC (F2) 
and/or Will x PI507354 (RIL) mapping populations. At least one SSR per BAC sequence 
is mapped. DNA markers that exhibited codominant banding patterns are scored as 
homozygous for one or the other parent or as heterozygous, exhibiting both parental 
alleles. Marker scores are checked for segregation distortion using the chi-squared test 
for goodness of fit to expected ratios. Linkage relationships were determined using 
Mapmaker Version 3.0b with a LOD of 3.0 (Whitehead Institute for Biomedical 
Research, Cambridge MA). 
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Thirty-two BAC DNA superpools (10 genomic equivalents) extracted from either 
4608 clones (48 96-well microtiter plates) are used as templates for the first round of PCR 
screening. Following identification of the positive superpools, the second screening is 
performed against 4-D BAC DNA pools. Each clone of the superpool is addressed 4- 
dimentionally (7 X 7 X 12 X 8) and pooled in each dimension. Each set of 48 plates is 
divided into 6 sets of 7 plates and one set of 6 plates, and partitioned in two ways. The 
first partition is in numerical order, plates 1-7, 8-14, . . . 43-48 representing 7 group or 
stack pools. The second partition is according to plate position within each of the 
respective stacks, plates [1, 8, 15, 22, 29, 36], [2,9,16, 23, 30, 37, 43] etc., representing 7 
plate pools. Each well of the 96- well plates contains 12 columns and 8 rows. Clones 
from row 1 are pooled from all 48 plates to generate the row 1 pool. Clones of rows 2, 3, 
4. . ..8, and columns 1, 2, 3. . ..12 are pooled to generate 8 row pools and 12 column pools 
respectively. 

For each superpool, BAC DNA is extracted from a total of 34 subpools (7 + 7 + 8 
+ 12). Positive clones are identified by TaqMan/PCR screening of the 34 subpools if one 
positive clone is present. If more than one positive clone is present in a supeipool, a third 
round of screening with N4 PCR reactions is performed. 

Addresses of candidate BACs are identified, and the candidates are streaked out 
for single colony isolation and grown overnight at 37°C. A single, isolated colony is 
picked and streaked out and grown overnight at 37°C. PCR is repeated for the marker of 
interest (using the program designed for the relevant marker) using a smear of cells from 
the plate streaked from a single colony. The PCR product is run on a 2% agarose gel and 
purified using the Clonetech NucIeoSpin Gel Extraction Kit (according to the 
manufacturer's instructions, Clonetech Laboratories Inc., 1020 East Meadow Circle, Palo 
Alto, CA) and 10-50 ng of the purified DNA are added to 10 pmol of each primer 
(forward and reverse), in a total volume of 6 pi of ddH20 and 2 pi of BigDye 
Terminators (Applied Biosystems, 850 Lincoln Centre Drive, Foster City, CA). The 
cycling conditions are: 96°C for 1 minute (step 1), 96°C for 10 seconds (step 2), 50°C for 
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5 seconds (step 3), 60°C for 4 minutes (step 4), steps 2-4 are repeated for 24 cycles (step 
5), and hold at 4°C. 

The generated sequence is compared to the consensus sequence using DNA 
comparison software. Confirmed clones are subcloned, sequenced, BAC-end sequenced, 
and Fingerprinted. 

BAC-end sequencing is done using 3.2 pmol of SP6 and T7 primers (separately), 
approximately 600 ng-1 ug of BAC DNA (Autogen prepped, AutoGen Corp., 35 Loring 
Drive Framingham, MA) reaction, resuspended in 6ul of ddH20, and 4ul of BigDye 
Terminators (Applied Biosystems 850 Lincoln Centre Drive, Foster City, CA) to give a 
total reaction volume of lOul. The cycling conditions are: 96°C for 2 minutes (step 1), 
96°C for 15 seconds (step 2), 50°C for 15 seconds (step 3), 60°C for 4 minutes (step 4), 
steps 2-4 are repeated for 50-60 cycles (step 5), 72°C for 2 minutes (step 6), hold at 4°C 
or 10°C (step 7). 

The reactions are ethanol precipitated and loaded on capillary sequencers. The 
newly generated BAC-end sequence is trimmed from the vector sequence, and entered 
into a database containing approximately 400,000 BAC-end sequences. Each BAC is 
blasted against the database to search for BAC-end matches extension of the contigs. 
New markers are designed from good BAC-end sequences, and these are then used to 
rescreen the library in order to build up contigs across the region of interest. Screening 
can be done in either of two ways: as above (PCR strategy), or by hybridization of high- 
density grid filters from Research Genetics (Research Genetics, 2130 Memorial Parkway, 
Huntsville, AL). 

The probes used for hybridization are derived from clones or genomic DNA by 
PCR amplification using the vector or gene-specific primers, with the appropriate cycling 
conditions. PCR products are run on a 1% agarose gel containing ethidium bromide (0.2 
ug/ml) in IX TAE buffer at 100 volt for 1-2 hrs. Isolated DNA fragments are excised and 
gel-purified using the Clonetech NucleoSpin gel extraction kit (Clonetech Laboratories 
Inc., 1020 East Meadow Circle, Palo Alto, CA), before labeling. In order to check the 
size of the fragments and concentration, 2 ul of eluted DNA plus loading buffer are 
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loaded on a 1% agarose gel along with DNA markers of known concentration and size. 
All the probes used to screen the library are tested individually for repetitiveness, with a 
smaller filter spotted with random clones from the library along with some positive 
control clones according to the protocol described below. 

The A3244 soy library generated by a an EcoRl digest is spotted on 3 high 
density grid filters from Research Genetics (Research Genetics, 2130 Memorial Parkway, 
Huntsville, AL). Each filter has six fields, twelve 384 well plates are spotted in each field 
in duplicate, with a total of 27,648 clones spotted on each filter. The plates are spotted in 
a 5X5 grid (12 clones per 5X5 grid) pattern within each field. Each clone is spotted in 
duplicate with a specific orientation within the 5X5 grid, which, together with the field 
position, gives information about its address. In a first round hybridization procedure, 
multiple probes are labeled separately and then pooled together to hybridize to BAC 
filters. Positive BACs identified in this procedure are deconvoluted by rehybridization 
with the individual probes. 

A hybridization oven is set at 65°C, and Church Buffer (0.5 M Sodium 
Phosphate, pH 7.0, 7% SDS, 1% bovine serum albumin, 1 mM EDTA, 100 ng/ml salmon 
sperm DNA) is prewarmed to 65°C. Membranes are washed in 500 ml of 0.1X SSC, 
0.1% SDS in a large container at room temperature for 5 minutes with gentle shaking (50 
rpm) on a rotary shaker. The membranes are rinsed with 500 ml of 0.1X SSC (no SDS) 
for 1 minute. The wash solution is poured off, and 500 ml of 6XSSC (no SDS) is added 
to equilibrate the membranes. Three filters are placed in a tube. The filters are separated 
from each other and the sides of the tube by a layer of mesh. Each tube is filled with 
6XSSC and shaken gently with the tube vertical to help eliminate bubbles between the 
filters and tube wall. The 6X SSC solution is poured off, and 25 ml of pre-warmed 
Church buffer is added. The bottles are rotated in a hybridization oven at 60 rpm and 
65°C for 30 minutes or longer. 

Probes are labeled using 1 ul of 40-50 uCi/ul [a 32 P dCTP], 50 ng of purified 
DNA in 49 ul of ddH20, and Read-To-Go Labeling Beads from Amersham Pharmacia 
according to the manufacturers instructions (Amersham Pharmacia, Uppsala, Sweden). 
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The probes are purified using the Bio-Spin Column P30 from BioRad according to 
manufacturers instructions (Bio-Rad Laboratories, 3316 Spring Garden Street, 
Philadelphia, PA). To 1 pi of the column-purified probe is added to a minipoly-Q vial 
(liquid scintillation vial) for each probe. 5 ml of scintillation liquid is added to each vial, 
and radiation activity for each vial is measured using a liquid scintillation counter. 

After the probes are purified and counted for radioactivity, 10-20 probes and one 
control probe (from 50 pi reaction) are pooled with 10 7 cpm/probe each, into one 1.5 ml 
eppendorf tube. The pooled probes are denatured at 99°C in a sand heating block for 10 
minutes. The tubes are cooled on ice or ice water about 2 minutes, and then spun down at 
14,000 rpm for 30 seconds in microcentrifuge. The tubes are pre-hybridized in 25ml of 
Church buffer for at least 30 minutes, which is then poured off. 40 ml of fresh 
hybridization solution (pre-warmed Church buffer) is added. The pooled-probe solution 
is added to the hybridization tube. The tube is rotated in the hybridization oven at 60 
rpm, 65 °C overnight. 

The probe solution is poured off, 30 ml of pre-warmed (65°C) IX SSC, 0. 1% 
SDS washing solution is added to the hybridization tube, the hybridization tube is rotated 
in the hybridization oven (at 65° C) for 15 minutes, and the process is repeat two times. 
At the last wash, the tube is rotated 180° and at the same speed for 15 minutes at 65°C. 
The washing solution is poured off, and 2X SSC (no SDS) is added. 

Excess liquid is removed from each filter by placing the filter on a piece of 3MM 
paper. The washed filter is placed on developed film with the DNA-side up (the side 
BACs were spotted on), covered with Saran wrap, and squeezed to force out liquid and 
bubbles. The Saran wrap is folded to the other side of the film, fixed it with tape, and 
then dried Kimwipes. The wrapped filters are placed into a film cassette with the DNA- 
side up (the side BACs were spotted on), which is placed on BioMax MS film (Biomax 
Technologies hie, Vancouver, BC, Canada) in a darkroom, and exposed overnight at 
room temperature without an intensifying screen. Film is developed with a film 
developer in the dark room the next day, and each film is labeled with filter number, 
probe used for hybridization, exposure time, and date. 
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Starting from Field 3, a 384-well grid is put on the field with the Al position of 
the grid on the upper right, and the grid is aligned to the image. The row and column 
position for each positive clone on the BAC recording spreadsheet is determined and 
recorded. The pattern of the hybridization signal is matched to known patterns. There are 
6 plate reference numbers for each of twelve patterns, which are arranged in the same 
manner as the 6 fields. Based on the signal pattern and field number, a plate reference 
number is determined for each positive clone. The grid is moved to the next field and the 
process is repeated. The original plate number (P) is determined using the following 
formula: P = (N-l) x 72 + R, where N is the filter number on which the identified clone is 
present and R is the plate reference number previously determined. The complete address 
of the identified clone is given by the original plate number plus its position on the plate 
determined previously. BACs' addresses are identified and converted to "imp" files 
according to a Q-bot file format. 

24 working plates are loaded into a Q-bot (Genetix, Queensway, New Milton, 
Hampshire, United Kingdom) 6-high hotel and media-filled 96-well plates are placed on 
the deck. The Q-bot is run following the standard manual using the program called 
"Rearraying98" with the settings given in Appendix III of the accompanying manual: 
BAC-Picking. Plates containing picked clones are placed in a shaker incubator and 
grown overnight at 37° C at 200 rpm. 

35 pi DNA solution are transferred from 96-well plates into a 384-well plate 
using a Platemate such that 4 96-well plates of DNA are combined into one 384-well 
plate. The 384-pin head (puck) is washed in 10% SDS solution for 5 minutes, 
ultrasonicated in a water bath for 3 minutes, washed with 70% ethanol for 1 min., and air 
dried for 3 minutes. The 384-well DNA source plates and membranes are arranged on the 
deck according to the instruction from the manual and the spotted grid design chosen for 
the membrane. Spotting pattern are designed so that there is one control probe at each of 
the 4 corners of the membrane. An asymmetric pattern is used to orient filters. The 
control probe concentration is about 5 ng/ul. Zeus is run according to instructions. If the 
DNA concentration is lower than 5 ng/ul, the Zeus is run a second time to double the 
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amount of spotted DNA on the membrane. One of the empty spots is spot dyed, if 
available, using one 384-well dye plate. If an empty spot is not available, it is printed on 
one of the DNA spots. This spot marks the position for cutting filters into small 
membranes (9X12 cm). Membranes are interleaved between 3M papers and left to air- 
dry. Each corner of each membrane is marked with a permanent marker and numbered. 
Filters are denatured on the surface of 3M paper soaked with denaturalization solution for 
4 minutes, and then neutralized on the surface of 3 M paper soaked with neutralization 
solution for 5 minutes. The filters are washed with 2XSSC for 5 minutes and then air 
dried. The filters are then baked at 80°C for 1 hr. and cut into individual small 
membranes (9X12 cm) according to the marked corner. 

To confirm and deconvolute, hybridizations are done as before, but with newly 
generated filters, and each probe is done separately with a single filter using the smaller 
tube. 15 ml of Church buffer is used for the hybridization. 

Fingerprints are generated by digesting the BAC DNA with Hind III for 3 hours 
at 37°C and running the reaction on a 0.8% gel at 200V for 19 hours. The gels are stained 
with SybrGreen, while shaking at room temperature for 45 minutes, and scanned with a 
Flourlmager. The bands are sized using Frag software and the fingerprints are assembled 
into contigs within FPC. Every time new clones are added the contigs are rebuilt using a 
tolerance of 10 and a cutoff of 10' 9 . 

Subclones are generated and Sanger sequencing reactions were performed on 
randomly chosen subclones using BigDye Terminators (Applied Biosystems, 850 Lincoln 
Centre Drive, Foster City, CA) then analyzed on ABI 377/ABI 3700 automated 
sequencing machines (Applied Biosystems, 850 Lincoln Centre Drive, Foster City, CA). 
7-8 fold sequence coverage is thereby generated across the BAC. The sequences are 
evaluated for quality and error probability using the program, phred, assembled using the 
phrap assembler, and viewed using consed, as in example 2. For Bermuda standard 
BACs, all contigs are ordered and oriented and all gaps are closed using a directed primer 
walking strategy. A final quality value of phred40 (1 base error in 10,000 bases) with no 
gap regions, double coverage or two chemistries across single stranded areas is achieved. 
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The sequence contigs are put into an ACEDB database along with soy EST and 
plant EST matches, along with Blastx, Tblastx, and Plant Blastn hits. GenemarLhrnm is 
used to predict possible genes, and GeneFinder is used to predict splicing sites, ORFs, 
potential coding regions, as well as start and stop codons. The contigs are then annotated 
by hand and predicted genes accepted, edited, and modified based on the characteristics 
present in the sequence and matches to protein, nucleotide, and EST databases. 

The high-density BAC library membranes used for hybridization are made by 
Research Genetics (Research Genetics, 2130 Memorial Parkway, Huntsville, AL), using a 
modified Q-bot (Genetix, Queens way, New Milton, Hampshire, United Kingdom), 384- 
well plates containing BACs are spotted onto 22 cm X 22 cm Hybond N+ membranes 
(Amersham Pharmacia, Uppsala, Sweden). Bacteria from 72 plates are spotted twice onto 
one membrane, giving 55,296 colonies in total, or 27,648 unique clones per membrane. 
The plates are spotted into six "fields" per membrane, with each field having 12 plates 
spotted in duplicate. This spotting format results in six fields with 384 grids in each field. 
Each grid is a 5X5 matrix containing 12 unique clones in duplicate, with the center 
position left empty. The two positions occupied by each clone in duplicate are designed 
to give a unique pattern that indicates the plate location of each clone. After spotting, the 
bacteria on the membrane are incubated for 8 hours on LB-agar plates containing 12.5 
ug/ml chloramphenicol. The membranes are then denatured, neutralized, washed in a 
standard procedure, UV-light crosslinked, and air-dried. The membranes can be stored 
and shipped at room temperature. 
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What is claimed is: 

1. A method for the production of a soybean plant having an rhgl SCN resistant allele 
comprising: 

(A) crossing a first soybean plant having an rhgl SCN resistant allele with a 
second soybean plant having an rhgl SCN sensitive allele to produce a segregating 
population; 

(B) screening said segregatir" population for a member having an rhgl SCN 
resistant allele with a first nucleic acid molecule capable of specifically hybridizing to 
linkage group G, wherein said first nucleic acid molecule specifically hybridizes to a 
second nucleic acid molecule that is linked to said rhgl SCN resistant allele; and, 

(C) selecting said member for further crossing and selection. 

2. The method for the production of a soybean plant according to claim 1, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to said second 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 2, 3, 
complements thereof, or fragments thereof having at least 15 nucleotides. 

3. The method for the production of a soybean plant according to claim 1, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to said second 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 5 or 6, 
complements thereof, or fragments thereof having at least 15 nucleotides. 

4. The method for the production of a soybean plant according to claim 1, wherein 
said first nucleic acid molecule is a nucleic acid marker capable of detecting rhgl 
haplotype 2 or 4. 

5 . The method for the production of a soybean plant according to claim 1 , wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group G within lOOkb of said rhgl 
SCN sensitive allele. 

6. The method for the production of a soybean plant according to claim 5, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group G and located within 50kb of 
said rhgl SCN sensitive allele. 
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7. The method for the production of a soybean plant according to claim 6, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group G and located within 25kb of 
said rhgl SCN sensitive allele. 

8. The method for the production of a soybean plant according to claim 7, wherein 
said rhgl SCN sensitive allele is also present in soybean line A3244. 

9. The method for the production of a soybean plant according to claim 1, wherein 
said first soybean plant having an rhgl SCN resistant allele is selected from the soybean 
maturity group consisting of 000, 00, 0, 1, II, HI, IV, and V. 

10. The method for the production of a soybean plant according to claim 1, wherein 
said first soybean plant having an rhgl SCN resistant allele is selected from the soybean 
maturity group consisting of VI, VII, VITI, IX, and X. 

11. A method of investigating an rhgl haplotype of a soybean plant comprising: 

(A) isolating nucleic acid molecules from said soybean plant; 

(B) determining the nucleic acid sequence of an rhgl allele or part thereof; and, 

(C) comparing the nucleic acid sequence of said rhgl allele or part thereof to a 
reference nucleic acid sequence. 

12. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 11, wherein said determining of said nucleic acid sequence of said rhgl allele or 
part thereof is a determination of a single nucleotide. 

13. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 11, wherein said determining of said nucleic acid sequence of said rhgl allele or 
part thereof is a determination of the nucleic acid sequence of an exon of rhgl. 

14. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 13, wherein said exon is exon 1 of rhgl v.l. 

15. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 13, wherein said exon is exon 3 of rhgl v.l. 

16 The method of investigating an rhgl haplotype of a soybean plant according to 
claim 13, wherein said exon is exon 1 of rhgl v.2. 
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17. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 13, wherein said exon is exon 2 of rhgl v. 2. 

18. The method of investigating an rhgl haplotype of a soybean plant according to 
claim 13, wherein said determination of said nucleic acid sequence of said rhgl allele or 
part thereof is a determination of the nucleic acid sequence of a leucine rich repeat 
domain. 

19. A method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant comprising: 

performing marker assisted selection of said soybean plant with a nucleic acid 
marker, wherein said nucleic acid marker specifically hybridizes with a nucleic acid 
molecule having a first nucleic acid sequence that is physically linked to a second nucleic 
acid sequence that is located on linkage group G of soybean A3244, wherein said second 
nucleic acid sequence is within 500 kb of a third nucleic acid sequence which is capable 
of specifically hybridizing with the nucleic acid sequence of SEQ ID NO: 5, 6, 
complements thereof, or fragments thereof having at least 15 nucleotides; and, 

selecting said soybean plant based on said marker assisted selection. 

20. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with a sensitive soybean recurrent parent. 

21. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with an RhgA SCN resistance soybean 
recurrent parent. 

22. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with an elite soybean recurrent parent. 

23 . The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said nucleic acid marker is selected from 
the group consisting of SEQ ID NOs: 54-1096 and complements thereof. 
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24. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said nucleic acid marker is capable of 
detecting a single nucleotide polymorphism selected from the group of those set forth in 
table 2. 

25. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said nucleic acid marker is capable of 
detecting a SSR selected from the group consisting of SEQ ID NOs: 54-1096 and 
complements thereof. 

26. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 19, wherein said physically linked second nucleic acid 
sequence is within 100 kb of a nucleic sequence which is capable of specifically 
hybridizing with the nucleic acid sequence of SEQ ED NO: 5, or 6, complements thereof, 
or fragments thereof having at least 15 nucleotides. 

27. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 26, wherein said physically linked second nucleic acid 
sequence is within 50 kb of a nucleic sequence which is capable of specifically 
hybridizing with the nucleic acid sequence of SEQ ID NO: 5 or 6, complements thereof, 
or fragments thereof having at least 15 nucleotides. 

28. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 27, wherein said physically linked second nucleic acid 
sequence is within 25 kb of a nucleic sequence which is capable of specifically 
hybridizing with the nucleic acid sequence of SEQ ID NO: 5 or 6, complements thereof, 
or fragments thereof having at least 15 nucleotides. 

29. A method for the production of a soybean plant having an RhgA SCN resistant 
allele comprising: 

(A) crossing a fust soybean plant having an RhgA SCN resistant allele with a 
second soybean plant having an RhgA SCN sensitive allele to produce a segregating 
population; 

(B) screening the segregating population for a member having an RhgA SCN 
resistant allele with a first nucleic acid molecule capable of specifically hybridizing to 
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linkage group A2, wherein said first nucleic acid molecule specifically hybridizes to a 
second nucleic acid molecule linked to said RhgA SCN resistant allele; and, 
(C) selecting said member for further crossing and selection. 

30. The method for the production of a soybean plant according to claim 29, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to said second 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 4, complements 
thereof, or fragments thereof having at least 15 nucleotides. 

3 1 . The method for the production of a soybean plant according to claim 29, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to said second 
nucleic acid molecule having the nucleic acid sequence of SEQ ID NO: 7, complements 
thereof, or fragments thereof having at least 15 nucleotides. 

32. The method for the production of a soybean plant according to claim 29, wherein 
said nucleic acid molecule is a nucleic acid marker capable of detecting RhgA haplotype 
3. 

33. The method for the production of a soybean plant according to claim 29, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group A2 within lOOkb of said 
RhgA SCN sensitive allele. 

34. The method for the production of a soybean plant according to claim 33, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group A2 and located within 50kb 
of said RhgA SCN sensitive allele. 

35. The method for the production of a soybean plant according to claim 34, wherein 
said first nucleic acid molecule is capable of specifically hybridizing to a nucleic acid 
molecule having a sequence that is present on linkage group A2 and located within 25kb 
of said RhgA SCN sensitive allele. 

36. The method for the production of a soybean plant according to claim 35, wherein 
said RhgA SCN sensitive allele is also present in soybean line A3244. 
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37. The method for the production of a soybean plant according to claim 29, wherein 
said first soybean plant having an RhgA SCN resistant allele is selected from the soybean 
maturity group consisting of 000, 00, 0, 1, II, HI, IV, and V. 

38. The method for the production of a soybean plant according to claim 29, wherein 
said first soybean plant having an RhgA SCN resistant allele is is selected from the 
soybean maturity group consisting of VI, VII, VHI, IX, and X. 

39. A method of investigating an RhgA haplotype of a soybean plant comprising: 

(A) isolating nucleic acid molecules from said soybean plant; 

(B) determining the nucleic acid sequence of an RhgA allele or part thereof; and 

(C) comparing the nucleic acid sequence of said RhgA allele or part thereof to a 
reference nucleic acid sequence. 

40. The method of investigating an RhgA haplotype of a soybean plant according to 
claim 39, wherein said determining of said nucleic acid sequence of said RhgA allele or 
part thereof is a determination of a single nucleotide. 

4 1 . The method of investigating an RhgA haplotype of a soybean plant according to 
claim 39, wherein said determining of said nucleic acid sequence of said RhgA allele or 
part thereof is a determination of the nucleic acid sequence of an exon of RhgA. 

42. The method of investigating an RhgA haplotype of a soybean plant according to 
claim 41, wherein said exon is exon 1. 

43. The method of investigating an RhgA haplotype of a soybean plant according to 
claim 41, wherein said exon is exon 2. 

44. The method of investigating an RhgA haplotype of a soybean plant according to 
claim 41, wherein said determination of said nucleic acid sequence of said RhgA allele or 
part thereof is a determination of the nucleic acid sequence of a leucine rich repeat 
domain. 

45. A method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant comprising: 

performing marker assisted selection of said soybean plant with a nucleic acid 
marker, wherein said nucleic acid marker specifically hybridizes with a nucleic acid 
molecule having a first nucleic acid sequence that is physically linked to a second nucleic 
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acid sequence that is located on linkage group A2 of soybean A3244, wherein said second 
nucleic acid sequence is within 500 kb of a third nucleic acid sequence which specifically 
hybridizes with the nucleic acid sequence of SEQ ID NO: 7, complements thereof, or 
fragments thereof having at least 15 nucleotides; and, 

selecting said soybean plant based on said marker assisted selection. 

46. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with a sensitive soybean recurrent parent. 

47. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with an Rhgl SCN resistance soybean 
recurrent parent. 

48. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said introgression of said SCN resistance or 
partial resistance is carried out by backcrossing with an elite soybean recurrent parent. 

49. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said nucleic acid marker is selected from 
the group consisting of SEQ ID NOs: 54-1096 and complements thereof. 

50. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said nucleic acid marker is capable of 
detecting a single nucleotide polymorphism selected from the group consisting of those 
set forth in table 4. 

5 1 . The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said nucleic acid marker is capable of 
detecting a SSR selected from the group consisting of SEQ ID NOs: 54-1096 and 
complements thereof. 

52. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said physically linked second nucleic acid 
sequence is within 100 kb of a nucleic sequence which specifically hybridizes with the 
nucleic acid sequence of SEQ ID NO: 7 or complement thereof. 
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53. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said physically linked second nucleic acid 
sequence is within 50 kb of a nucleic sequence which is capable of specifically 
hybridizing with the nucleic acid sequence of SEQ ID NO: 7, complements thereof, or 
fragments thereof having at least 15 nucleotides. 

54. The method of introgressing SCN resistance or partial SCN resistance into a 
soybean plant according to claim 45, wherein said physically linked second nucleic acid 
sequence is within 25 kb of a nucleic sequence which is capable of specifically 
hybridizing with the nucleic acid sequence of SEQ ID NO: 7, complements thereof, or 
fragments thereof having at least 15 nucleotides. 

55. A substantially purified nucleic acid molecule comprising a nucleic acid sequence 
selected from the group consisting of SEQ ID NOs: 5, 6, 8-23, 28-43, complements 
thereof, and fragments of either. 

56. A substantially purified first nucleic acid molecule with nucleic acid sequence 
which specifically hybridizes to a second nucleic acid molecule having a nucleic acid 
sequence selected from the group consisting of a complement of SEQ ID NOs: 5, 6, 8-23, 
28-43. 

57. A substantially purified nucleic acid molecule comprising a nucleic acid sequence 
selected from the group consisting of SEQ ID NOs: 7, 44-47, and 50-53, complements 
thereof, and fragments of either. 

58. A substantially purified first nucleic acid molecule with nucleic acid sequence 
which specifically hybridizes to a second nucleic acid molecule having a nucleic acid 
sequence selected from the group consisting of a complement of SEQ ID NOs: 50-53. 

59. A substantially purified protein or fragment thereof comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 1097, 1098, and 1100-1115 
and fragments thereof. 

60. A substantially purified protein or fragment thereof comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOs 1099, and 1 1 16-1 1 19 and 
fragments thereof. 
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61. A transformed plant having a nucleic acid molecule which comprises: (A) an 
exogenous promoter region which functions in a plant cell to cause the production of a 
mRNA molecule; (B) a structural nucleic acid molecule encoding a protein or fragment 
thereof comprising an amino acid sequence selected from the group consisting of SEQ ID 
NOs: 1097, 1100, 1098, 1101, 1102-1115; and (C) a 3' non-translated sequence that 
functions in the plant cell to cause termination of transcription and addition of 
polyadenylated ribonucleotides to a 3' end of the mRNA molecule. 

62. The transformed plant according to claim 61, wherein said plant is soybean. 

63. The transformed plant according to claim 62, wherein said plant is soybean is 
selected from the group consisting of PI548402 (Peking), PI200499, A2869, Jack, A2069, 
PI209332 (No:4), PI404166 (Krasnoaarmejkaja), PI404198 (Sun huan do), PI437654 (Er- 
hej-jan), PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), PI548988 
(Pickett), PI84751, PI437654, PI40792, Pyramid, Nathan, AG2201, A3469, AG3901, 
A3904, AG4301, AG4401, AG4501, AG4601, PION9492, PI88788, Dyer, Custer, 
Manokin, and Doles. 

64. The transformed plant according to claim 61, wherein said promoter is an rhgl 
promoter. 

65. The transformed plant according to claim 61, wherein said promoter is an Rhg4 
promoter. 

66. A transformed plant having a nucleic acid molecule which comprises: (A) an 
exogenous promoter region which functions in a plant cell to cause the production of a 
mRNA molecule; (B) a structural nucleic acid molecule encoding a protein or fragment 
thereof comprising an amino acid sequence selected from the group consisting of SEQ ID 
NOs: 1099, 1116-1119; and (C) a 3' non-translated sequence that functions in the plant 
cell to cause termination of transcription and addition of polyadenylated ribonucleotides 
to a 3' end of the mRNA molecule. 

67. The transformed plant according to claim 66, wherein said plant is soybean. 

68. The transformed plant according to claim 67, wherein said plant is soybean is 
selected from the group consisting of PI548402 (Peking), PI437654 (Er-hej-jan), 
PI438489 (Chiquita), PI507354 (Tokei 421), PI548655 (Forrest), PI548988 (Pickett), 
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PI88788, PI404198 (Sun Huan Do), PI404166 (Erasnoaarmejkaja), Hartwig, Manoldn, 
Doles, Dyer, and Custer. 

69. The transformed plant according to claim 66, wherein said promoter is an rhgl 
promoter. 

70. The transformed plant according to claim 66, wherein said promoter is an RhgA 
promoter. 

71. A transgenic seed having a nucleic acid molecule which comprises: (A) an 
exogenous promoter region which functions to cause the production of a mRNA 
molecule; (B) a structural nucleic acid molecule encoding a protein or fragment thereof 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 
1097, 1100, 1098, 1101, 1102-1115; and (C) a 3' non-translated sequence that functions 
to cause termination of transcription and addition of polyadenylated ribonucleotides to a 
3' end of the mRNA molecule. 

72. A transgenic seed having a nucleic acid molecule which comprises: (A) an 
exogenous promoter region which functions to cause the production of a mRNA 
molecule; (B) a structural nucleic acid molecule encoding a protein or fragment thereof 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 
1099, 1116-1119; and (C) a 3' non-translated sequence that functions to cause termination 
of transcription and addition of polyadenylated ribonucleotides to a 3' end of the mRNA 
molecule. 
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gaagtaaaaa ttaatcttat ttgttttaat 

tacaaaattt agttaaaatc aagattaatt 
agttattcac acaattatat ataggctttt 
aaggtatagt ggtccctaac tacgccagtg 
cttaacggag aaaattttta cgtggggttc 
gtatcacctt tttcattcgc aaatatatat 
cggttcagtt gacacttgac aaagttattt 
aaatgacagc taatactgag aactctctta 
gtgaaagact gaaagcagct tccaccgact 
aatgtggaac agataatgac ggggattgta 
accattgaca aagataatcc ttcatattgc 
gataaaattt cagtcaactt aaggccaatc 
ttgcaaaaag ataataagga aattatctat 
ttccttagcc agactcatga tccaaactca 
ttaaggagtt ttcccatttt tcttgatttt 
gcgataccct ttggactcgt tgtgacaata 
cctttcacta taaatacttg ttaatgttat 
aattctttta gcttagtcac gaagaagtta 
atggttcaag aattccatta gacaaagaaa 
agctactaga ccaattgacc aaagaagttt 
gaatatgaga tatttgattc tcattgatac 
cttctgtttc taagatattt ctataaaggg 
taactaaccc actaaataat ctttttttca 
cttttggaaa acaaaagtag agtgataagg 
agtgtgggtg agaacagaaa agagaagcat 
gtgcttaagt tttgaagtac cggaggtata 
atattaataa taattcacaa gagaagaaat 
gttaaatttt attttatttt ttggaatttt 
attaactcat gtatgataac tttttaatag 
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tttatcataa ttttaaaaat aatttaaacg 1080 

cacatcagca tagtctacca gaattttgaa 1140 

tcaccattca gattcaatga tatgaatgga 1200 

aataaagaac cttagaacgg attataactt 1260 

aagaaagtag ataagaagga acgcattcct 1320 

gtaaatagta aaaatttatg gtactgcttg 13 80 

atttgatatg taaaaagaaa tggacatatc 1440 

ttgaagagca ataatttatt gaggacgaaa 1500 

catgcacata tccgatcgaa tgaacaacat 1560 

tttgaagtga tgcaagtgca agcatttagc 1620 

aacggctatg agccttttac cctctccatt 1680 

aaactcacaa atatatgtca agtttgtact 1740 

caaactaaaa taaaattcac aaaatcatca 1800 

tatttttaat taaatttcag tcttctaaca 1860 

aaaaagaaaa tatttcaatc gtctctctta 1920 

ttcataacac tttctcctaa aatctttcga 1980 

ctaaaagtat gaagcattat tttgtaagca 2040 

ttggaagtgt cataattaat ttcatcatgc 2100 

attgtctcag tagttatcat cgataacaca 2160 

cgtcttttaa attcttttcc catcttgttt 2220 

tttttcgata agataaaggt gacattttgg 2280 

aaccaaaaga agcaatttga atcattaatt 2340 

attagcaaaa ctgctgaaag ctcagtccca 2400 

aaaagataaa aataagaaaa tgcaacgaaa 2460 

gcgtcaggat accaccctaa actaaagtat 2520 

caccaaaggc ttagtttcat ttgcattaac 2580 

ataaaataag ataaatcaat tttttcacaa 2640 

tacaagaaat aaaaaaattc ataacttaaa 27 00 

tttttttgtt taattttcct aaaagataag 2760 
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gcgtataaat taattttagt ttacgagaga agttggattt gtttatttct tatattaagt 2820 

gccttgttta gaaatttatt gaaagtgacc caaaatcaat tatatgtgga tttaaacact 2880 

tcaaagctcg ctgaaactga aagcatttca ttcaaaagta aatttaacgt taatgttccc 2940 

ctgattagtt gagaaaagga catgaattgt actagtatta atttacctcg agatcaaaga 3 000 

gagcaaaaag tttaatcata ctatatttac gcagaaaaag atgataaaca agtgtatagt 3060 

acagacgtga gacgtggaat aaaactttga ttagtaataa attggtcatt gttgctttta 3120 

gaaacttgct gtggacacac atgacaaaag ggatttgcat ttttatgaag ctcgtctcat 3180 

gtttaaactt taatttggac ggaattgaaa aacatctcat tgatatattc atgtgctttc 3240 

ttcttgctag cttttggatt gtagatactc tttgattatc ataaaaggaa cgaacgactt 3300 

taatttggaa attgagaccg aagttgcaat agaaatcgtt acacgtcttc tctagtgtgt 3360 

atgtattgtt tccttttggg tttaaaggat gaccaaatgt tgaggccagt cacaatataa 3420 

agggaactca cgaagctggg gaaagtgttt agtatagttt aatgtttaaa ctcatgcacg 3480 

caccagatat cttaagaatc tataccagat atataacggt tgtctatata gtttctttcc 3540 

attcccattg ttggagaatt aaagcttctg attatttagc atattttatt gtataatttt 3 600 

actatatagt attttgtact ttaagtatct tatcaaatct tcggttgcaa catcattgga 3660 

gaagatatag tggcattcat aatgaataca agtccaatga attcacttta tgttgtcaac 3720 

tagtagtcca atgaattatg ttgtggttgg cttaagaatg aatacaagtc aaatttcagc 3780 

tggcttacag tcacagtggc atggaatata taacagagtg ttgcgtcaaa aaaacaataa 3840 

tgacgacaac aggactaaat acaatgcatg tatttttttt ttaattgaga aataatttat 3900 

tctttaaagt tctctaaaaa aagtaacatt ttactaataa ttttccccac tcgaattgag 3960 

ggtattaaag taattttcat tgtttttgct tttatttttt atttaaaaaa catttgttca 4020 

gttaatgaga ttaatgaatt ttgtcacttt tgggtaacac atgtaaaatt gaagacggat 4080 

gcagaaacaa acacctacta aatgacagta tatgaaacct ttatatcaag tacatatatt 4140 

attttctcaa ttttctttaa ctttaactca ttaagccaat cgatctatta catctcttac 4200 

tttgatttga tgtttcactt atcattaatt aaagataaac aatgtattac attttcttaa 4260 

ctaaaaatta gcatctaaga catatatttt tgtagttcat tttgtcatga aaaatatggg 4320 

tacatcatcc gatacaatta ctaaactaaa ctaagggaag aattattctc acttttatat 43 80 

ttgttattta aactatcaat tctgaaattt ctattttgcc cactaaccaa attcctccac 4440 

cccctctctt ttcccthccc tccttctcat atccaaaacc gttgccccta caccaaccta 4500 
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gactgcatca cccctccctc agcgccaccc 

gaccacgcca tccctccctc cgcaacccat 
tccatgcaat tcgttgatcc acacagcacc 
ctcttttttg gattttattg caggttttgt 
tttttacacg ctactgtctg taatggaaat 
ttctttaccc catgtgtgca acgaaaatag 
atattttcgt gggaaggaca agctaaaaaa 
aatagcaata ttggtattgc tagttcccct 
tatgcttaat taaaactttc atacctgaaa 
ttattttttt taatcaagta gttgaaaaaa 
ttttcatcaa gtggtaatat gacatacgga 
actcatactc atgttatcat cttcgatgat 
ataaaaaaat tttagataac tatttgacaa 
taatttattt tccaggtatg aaaaacttaa 
taacaaatga taaggagaaa ttttataaaa 
aattttatta tttcattaaa aataaaaata 
taaaagtaaa tcttataagt caataagaaa 
aaatatttgt aagtttgtgt aaaaaactaa 
atgtaattta cttttatata gacttaaaaa 
tcaaaataag aaatacatta aattaatgag 
tactatttga aattcacaat aacacttatc 
ttaataaaat aatattaaag attaaattaa 
gatcaaacac tgagatatat attttggatc 
gcataacatg ttacatacat tgaaccaaca 
tgtttaaata aatttttaat acctgaaaaa 
tcattttttt caaataccta ctagaaaaaa 
tcataatcac gtccactcgg tcatatgtca 
cgtgacaagg tgtgatgacg taacattttg 
ctaacaacaa ataataaaat aaaattgaat 
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ttctccccac acaaaacgtc gtcaatccac 4560 

tgtcaatcca catgcatgcc atcctccccc 4620 

ttgccccatc aagatcaagc acccttgttt 4680 

tgttgttgaa tcgtatttct gttgcatcat 4740 

agaatgaaat cgtatttctt ttgctttttt 4800 

aattttgtta gagacttatt taacggagtc 4860 

aaaatatttg aaactaagag ggattgtgcc 492 0 

gaactacata aaaccttcct cgtattgaaa 4980 

tatgggtgtc ttcaaattat tagctaatat 5040 

aaattcagtt tcattttatt atttatttat 5100 

gtgttgcctc aacctgtctt gttatggata 5160 

tgtgacaata acatgttggt aaattgaaac 5220 

aaaatgaatt tttaggtagt aatttgaaaa 52 80 

ttaaatcaat ctgaaaaata attttttagc 5340 

ataaataatt taatttcaaa taaaatgata 5400 

aaaatggtaa tcataaattt aagttatgtt 5460 

aaattatttc tcaaacactt ttatttgatc 5520 

aaattaatta aaataacctg atgagcatat 5580 

actttatctt tttttaagat aatttctcat 5640 

atattttttt atttgtagaa ttaaaaaaag 5700 

aatttatcat atttaatcag ctgaactaaa 5760 

aaagttgagt tacatacatt gaaccaacaa 5820 

cagagaataa taaaggaaga gtgaaggaag 5880 

agatcaaaca gtgagatatt aattatttat 5940 

catatcattt ttaaattatt acttaaaaaa 6000 

aaatttagtt ttattttatt atgtcattaa 6060 

ttgaaagtga taacgtaaca ataaaatatt 6120 

tatgtcacat tatcatttca ctaaagacaa 6180 

tttttttaaa atacttaact ttaaaaaata 6240 
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aatattaaat aataatctaa aaatgatctg 

gaaatataac cttcttattc agtcattgtg 
tatagtaatt agctctctta gctagaaaca 
gcacaaaacg acaaaatgat ttcatgattt 
ttcagagcaa ttaaatgaac gcaattacga 
ctattggcta tttaaaattt cactttgaca 
gcatgacaat tagcattaaa ggttagctaa 
aaaaaaactg agaaacatga tttaagtttt 
aaaaaactta cacccttcaa atgtactaga 
aattaaatgt catactcaac agtcaacacg 
aaatcaacca agtaacataa attagttgac 
atgggataac accaggcata ttaatttaca 
agataatggt ccagtacgtg ttaggaatct 
cattggatgg tgtaagatct aaaatactta 
tatttggatg tacatgtaat aactctatct 
gaacgtgtct gtctcagtaa agggcagaca 
gccaatgtac ccggcacgat gaaatcatcc 
ctcttccagc aacagctgga gcaactgcat 
gcaatgggtt tgatgaagtc tcatttggag 
cctgacctac agatgcatgt agactgtata 
agaacagtgt taaggtagaa aaaagccttt 
ccaagaactc acaagaagtc agcttcattg 
aatatattag cttataaact caaagtacta 
cttcttgttt cccctatcaa ataccaaacg 
aatttattag ttttaagatc aaagtatctt 
agtagacttt gataaggaaa atatttaact 
ttggctgtat ggctaaacaa aatccaatga 
atttagctac aagccgacaa ctattcttat 
atgtcaaaat tttagtccag gatcacttaa 
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tatgtccatt atcaaaaatt taagtaagcc 63 00 

gattcgctaa caactcgtgc tgatcgagcc 63 60 

ttagcccctg attatcaata tgatgttctt 6420 

ttagataaat aagcgcagga ttctctatta 6480 

ggcagcaaat gcaaaatgct tcacatgagg 6540 

tttttataat taattttgtc tcaataattc 6600 

aactctaatt taaattgtta caatatttta 6660 

gttattatat caaatactac tataaaatag 672 0 

ttcggatttt ttttatagaa aatattttaa 6780 

gtatttcaca gtcccctaga atttcaaaca 6840 

tgaaaaaatg aattaaatta aaaggcagtc 6900 

taacctcacc cgagcaaaag cgggtttaac 6960 

aacatgctgg caatgtcaaa aaaataacag 7020 

caaagctagg aggaggacaa aatggataca 7080 

agacaggcta gttgagatac ataagaataa 7140 

caagtagaag tagaagaaac aaatagcagt 7200 

gagatggagc agccgaaggt ttgtggggag 7260 

gcccgttcgt tctttgttgc tcatctgtag 7320 

aaaggggcat cctcttgcct agactcagat 73 80 

taagcaaaag gaataaaaag ggagacggga 7440 

gcatcaagca ccaggcaaat ggttaagaga 7 500 

cctaagtaga atgattagaa ctaaagctaa 7560 

tgactcacaa tttgagcgtg accacgctag 7620 

gtatcctgtc atgaagttct ctgccaaaaa 7680 

ttaataccat attccagagt atgggtaatc 7740 

tacgtccgat tgttcccata tctccttcag 7 800 

ctccctgcac aggaaggacc gatctaacta 7860 

aatgaagtct cttgtaacat ctttaattat 7920 

ttcacaccca cctcattgcc atagaataca 7980 
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aatacggggt catagaccac aaaactgtta ttctgttgga acgtgagagt taagctggga 8040 

acctttggca actcttgcga actgtctcag aaaagaataa taaataaata aagccatcaa 8100 

agagaccaga aaattctacc aattaggaaa tcatgcacca acgcaagagg gaagagagac 8160 

agagatctat ccagaaaact cacctgggaa cataacaata ctcccaagga gatccttcaa 8220 

agctagatct tgaaccattt acttgttgat caaactatca caaataaaat catattaaaa 8280 

gactgagcag aaatttacat taatggaatt tagaaaacta aagtacctcc tcagctattg 8340 

ctccatacac atgaccagga agaaaagtaa atgatgtccc gctatcaacc tgcactttaa 8400 

aacttgtcat tttaagacaa gaattcccaa cacaacatga ctccactcca ataatgtagg 8460 

ttgaactgac caggaaattt atcataatta gcactgtgac accattttct aaagtatatt 8520 

ttagtgcagt gaaacattgt aaactaattt aaagtacaga atttcatact ataatccatc 8580 

caaaggcaag aatgaagtag actgttggat ggttggtccc tggtccccaa aaaatattct 8640 

accagaatca tcttcattaa agcacaagga aaaagaatcg tggattaatc ctgatttagc 8700 

aagaaaactt ggaactgaac tctccccagg ccccaaacct aataggccat ctggagcaac 8760 

cccatccaaa taaccaccac tttgcttcat accacacctg cattcaacaa aactgacgtc 8820 

agagaaaaaa ccaggaaact tgtttttaat taaaaaaaga atacatcatc agggaaagtg 8880 

atgttgaagt aacaaagaca gggcagtcag ccacacccaa gaacaactgg agcctgaaca 8940 

gatgagtttg ataaactgcc gcctgactga agatgcagta tgtcctcaac caacaatcca 9000 

gaactcgacg tgttctccga caagtaactg accatgtatg gacactgctg ctgtgaactt 9060 

ttacaattcg aacccttatc acacaactga tgactgcaag atagatgctt actggataag 9120 

gaccgagacg gactatactc attcagatct ctatcctgca taatagatgc cccatatgtg 9180 

aaggagcacc tgccaatcaa ttactggaaa taagtgctaa acctgttata gattctaaac 9240 

ctcattgaac cattacactc agaaaggaac catgttttgt gttagtgtca cattaaactc 9300 

gaattgaaac cttatccaat caaggatttg caattcgcat gattaactat ttgttaaaca 93 60 

atcaataaaa caagctaata taatccgata ttttattatt tttattacat ttaagatatt 9420 

gagactacaa gttacatagt agagtaaacc aacattttag ttcctgaaag tataaagcct 9480 

agtcacataa acattagtcc ccaaaactaa gaaacttcaa aaaagtccct gaagctgcaa 9540 

tccgccaatc gcattaatcc aaagttataa aaaaatatgt gacttaatga taatattatc 9600 

atatgtttaa gagaccaaaa tctcaagatg aaacaactag aaactccctg gactaatttt 9660 

aaattttcct tagtttgaag aactaatgtg acacctcgtt atgcttaatc atagtttact 9720 
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ctacatagta agagaaatca aagaaaaaaa 

gaatagtaac tcgatgacaa tggagcacac 
tcactccctg cgtccaacgc cacgagaaac 
gtgtaatgca acctaatcac cagagacgca 
aattagggat tattattatt attattatta 
catcgttttg ctaccgtgag aaggaaacag 
cttgcgccgg agaatgtcgc cggtgagaag 
cggccagtat ccggtcggag gccgaaccgg 
gcgggcggaa aacgtgatcg gaaccggcat 
cagcaaccgc caccgcatcg agatcgagat 
tttaactgat ttaagtaacg attagtgtta 
catcatcgcc atggatcgta tcgtttcgtc 
agtgagagtg agggtggata aaacaaacaa 
attagactgt tactaagtgc ttaattaatg 
taacagtaag tgattattgt aaatgatgat 
acgaagcgaa acgtcacgcg cggtggcccc 
ggccttttat tgccgatgtc gatttgtctt 
tattcctttc cttttacgaa ataaaaaata 
ttaagtattt agtttcatta tataaataaa 
ttatgtggtg tgggtcagga ataaagcttc 
cagctgaaca cattaattgt ttttaaagaa 
catgataatt tgactaatta atacttttag 
taattataat aataattaat aaaataaata 
' taatattagt atgatctcaa ctcaactgat 
ctgtggaatc ttaagttctt tctccaaaga 
gtgttctctt attctgccat ctccaattct 
tgtttaatga taaagatcaa atacgtttgc 
caataggtga agctaaccaa tgcacaagtg 
ggggtggtcc aaaatgtgaa ggtaagttta 
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tagattaggt atgatattca taccagattg 9780 

tgtacgcaat cacacggaat ccaaaggaga 9840 

gaagtgctcg gtgttcctat atcaatccac 9900 

ttatcagatt caaaccgaag aaaagggagc 9960 

ttattaccag ccgaagtcgt tgccgagcga 10020 

tagctggtag cgtgcgcctc cgactttgat 10080 

catccggtag tagcccatgc tccaccggtc 10140 

tttcatttcg tcggcgaagc ggtgaacgag 102 00 

tgcgcgagcc gttaccaaaa gcaacaaaag 10260 

ctggcacttg cacttattct gatgcctcgt 10320 

attagtgagg tgagggtgcg cagtgtgcat 103 80 

cctgtgtggc tgtgtgtgag tgagagtgag 10440 

acaaaactag cgcattttgt tgcgggtgga 10500 

gggaaaggaa agtggtatga ttagtgtttg 10560 

taggaggaat aagggtgcaa cactgcagcg 1062 0 

accatgtctt tacgtgcttg agaatgaaac 10680 

tgccactgtg ggccccccca catttattat 10740 

aaaaatcaaa caaacaaggc aaaaggtttc 10800 

ataaatgcct agatctagta aataatcaca 10860 

acacacgaaa aaagaaatct tgcaagtaaa 1092 0 

atctaaagtt attgaagaaa acaactgaga 10980 

tgaaggagac gtattttaaa agataaagta 11040 

acgattaata tttagtaatt tcattctatg 11100 

aattttcaag ataatagtta taattgcact 11160 

aaaaaaaaaa cattttttct tc'cccttgtc 11220 

gttcacaatc gtaggttgtg ccgccaatga 112 80 

aatgaatcgg gatgacaaga ctgagacaac 11340 

ctccaatcaa taaaacaggc ccaaaaaggt 11400 

agtagggtgt tcacgccttg gattgcgtct 11460 
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gtgtaaatcc gtcacccaat ccaaacaaaa 

ttaaatcgac ctaatctgat catgaatgaa 
agttcaaaaa taattttctt aaatttttta 
cttgtaatat agttgcataa aaaaaattaa 
tcataaaatc aaattgaaaa caagtaacca 
ctaaatcaaa tttcaaccat aaagcagata 
taaagtacac actagtacaa aataaactta 
atagaaacac tgcaatatag tgataatgtc 
atagaaacac ggtaagcaaa agatcaaaat 
atgctatgca gaaaaaagaa atatgccaaa 
atattacctt aagaactaat agtcctaact 
agtagtaatc ctaacactaa cattatttaa 
taaaaagttc atcataacat aatatcaatt 
aaaaagaaac tattattgaa tacctagttc 
taaatcaccg acattttgct tattagtttt 
aacagtaatg ggacttaaag aactacaaaa 
atgcagactc aaatgacaca atagacataa 
aaataacatg atatttggat gctttcattt 
cgtcatcttc attgtcatcc tttaaataca 
attttttatt cattttcaat ctaaattggt 
tggcattacc ttgtgaagca tggtatgaag 
aaataggatg ttctgaagca tattcaacaa 
tcagcatctc tttggtttgg tcaacaccat 
caaatttata acacagatca agaaaagcag 
tctctcaata cttgttaaac ttgagttgca 
atcctcatgc ctccatctat ttaggcattt 
cttagctgta acatgtagtg acccagaaaa 
actcacaaac acacgagcat gtttctaatc 
tagaagagtg agcacatatg cagcctcaac 
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aatattggat ggatttgtgt gtttttcttt 11520 

tttgatcgag atggatttgt tattaaaaaa 11580 

aaatattttt tagaatttac aatacaatta 11640 

ccaccaattt caatgcacat attaactgca 11700 

acaaacattt aatttataaa gcaaataata 11760 

acaaattgtc ttgaaaactt agtaatctta 11820 

aaatcatccc aaaaaatata taatactaca 11880 

agacaattgc tcaaccagcc aacctcacac 11940 

caattattat actaataata aatttaaatt 12000 

aaagaaatca tatcataaac taagttaaaa 12060 

cccaatacta atactcctaa gaatagtcca 12120 

agtcaaacca tacaacttta aaaaatgttt 12180 

tatattcata ttgtaaacaa acggaaaaaa 12240 

catctttttt gtttcatcta attcaactcg 12300 

gagtcaattt tgggtacaaa tcaaagcttc 12360 

atggatcaag cactcaacct tttgtactaa 12420 

gaatgaccaa' tatatctcta gccatgaaag 12480 

tccaccatgc caaaatgtca aatccaagac 12540 

tatccaactc actcctttgc tattcaccac 12600 

cgtcccaatc ctcatcctca tcaacatcgt 12660 

ccaaagtact agaattacta ctatcaatgg 12720 

acatttttct tataagatca tccaattttt 127 80 

gcattttctt aaaacaaaac tcaatataat 12840 

tcacaaataa aagatagcta atctgatcac 12900 

tattagttgt ctcttttgta tcaccggatc 12960 

ctgaatagta accaacttct taaagaaatt 13 020 

aaaaattgca tcatagaaaa ctttcaaaaa 13080 

catctcttta ggacatcctc cttcactatt 13140 

atactcataa cgattgaaag cttgttcaaa 13200 
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ttttcagcaa catctaacat caaataagtg 

gaaaactttc aaaaaactca caaacacacg 
tcctccttca ctatttagaa gagtgagcac 
gaaagcttgt tcaaattttc agcaacatct 
ggcacattaa gtgttagcat tgcctttgaa 
aaagtagtca acctacttgg agaaaacctc 
attgaagagt caatttcctt aaacccatca 
aactcattgg caagtggact aaattgtcac 
tcaagtccac aggagcttta tttgtactta 
agataaagaa ttgagagtga aaagttaaag 
aaacaataat ttgaatgcaa aacatgactt 
ccaaaactac ttgtgctaca atgttaatat 
tactcgcatt tactatgata ttctatcttt 
attttttctc ccaaattcct ttgcaaagat 
gtatgaaata agctaaacaa acatcattct 
tttcccaatt ctctaggaaa caaccatttt 
gcatgggtga ttaaaccaca atcaataata 
aattgcatta aataaatagt aaggaagagt 
tcaacaatga ggggtttagc ctctaccatg 
gtaaaagaag aaaagggatg aataaagaaa 
tttcccctat tagagatgtt caggctttgg 
tgtgtctttc tccttttgct taaggtagct 
cccttctggg cttagcaggt atggcggtga 
tagcgagtat ggtagcaagc acgcacttag 
cttcttcatg aatcttcttc acgctaagct 
aagcctttct tgtatgctaa acaagctgtc 
cttcacgttt ttgcatcaat tttttctccg 
ctgatcaaga atcaaaatga tattaaaatc 
tagaggaatt cttagtcaaa attgactatc 
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gagtgaccca gaaaaaaaaa ttgcatcata 13260 

agcacgtttc taatccatct ctttaggaca 13320 

atatgcagcc tcaacatact cataacgatt 13380 

aacatcaaat aagtggagtt ccatctggtt 13440 

tttacactaa cgtcctccgc acacctcttt 13500 

acaaacttac atgcaacctt tattttacta 13 560 

catacaatca aattcaaaat gtgtgttgtt 13 620 

aagtaataaa ataaaataaa agtctgagta 13 680 

aattaatgta aacccaaatt ttcaaacaag 13740 

ggctagtaaa aggcaagaag ataagaaaat 13 800 

caaaatgcaa ttgtgttgaa acttagcatg 13 860 

tttttctcta tctaatatta tccccgtgtt 13920 

ggaaccccgc atcaagagcc taatttatct 13980 

aaaatagtaa atcatattaa gattaaagat 14040 

attcctagac atgatttcat ttagatgttc 14100 

caaatgcatt atctcctaaa caatacafcga 14160 

ataaagcaca taaaagaaca atagaactga 14220 

tacattacaa gagtttggtc tatcaggctc 142 80 

agagacttta cactttaagg gttgatggag 14340 

gagaaggaga gaatggttat agaaggaggg 14400 

gtgtattcaa gagagagttc taagtttcgg 14460 

taaaattcac acgatctcac gctaagagcg 14520 

tcacgcacta agtgcgacgg catctgggct 14580 

cgcgagattc acactaaacg cgcctttggg 14640 

tgatctgact actaagtgag acgacgcgct 14700 

tccatcttca attttttttc aagacttttt 14760 

gcacttgtaa tttccttctt ttgaatcttg 1482 0 

ctcattattt cattgaaaac aacagtaaag 14880 

aattaaaccc aaatttccca gttatcatgt 14940 
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gctgaacgct gcatatgtat aaactcttca 

agacctctac ttaaatatga aatggctaca 
caacaaattt tttgaatccc tcattctttc 
tcacctttgt ggtcaccaat caaacaaaat 
tcatcaatgt aataagttgt cacacacatg 
tcaattgtaa gtgaaaccat tgtttatttg 
tattaaacac ttgaatgcaa tatctagcaa 
gttgagccac ttccataaac ttcctaaacc 
catttgtaat gatcatcttt gcaagttcta 
atttcacact aagagcatta gggtcaccct 
tatttctcat caacctgcct aaagggtttt 
ttttggttcc atatttagaa ctattagcct 
catcattatt taccctatca aaatgattcc 
cgacccttgt tgtgtgggaa cctccagcac 
gctgctacag gttcagtagg ggacattgag 
cattgacact tccaataaaa cttctatttt 
gtcaactaca agtctacaac gtgtatcata 
actatatatt ttttcttaag tgaataacat 
gcataattat taatacaact atatattttc 
gagttgtaca aatacttact tagtagtaga 
ttattccaca aatccaaatt ccaaattcat 
ccttgtcatt tgatgagtat gctttgattt 
gtattcattg aaactcttga attagaaaag 
ttcatatcct tatgtgatcc aaattttctt 
tctctttatt tctacatcaa ccaagaatgt 
acactgttcc ccacagtgat agttcctaaa 
aaaaccacat ttataacaag caatacaaac 
tgctgtaaga gaatatcagg aaggtaatca 
aaatatctcc acctagagag tataagtaca 
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ttcaacaaag tatgatcatt tcaaacacta 15000 

ctgttagcac ttgcattatc aactgttata 15060 

aggcaattct ctagggctat ccaagtggtt 15120 

tttagtattt tcttatgcaa cctaaatact 15180 

taattcacat tttggattaa tgtccatgta 15240 

caaacaatat gcattccaac tgcttttcat 153 00 

tagtcacatg agaagaaata ttaaaataag 153 60 

tttcatattc aacaaattta aaagaaagtt 15420 

taatagttct ctcttgattg aaatcaacaa 15480 

cacatttctt tccaacaata attgtttttt 15540 

tcatacatga agctaaatgt ttgttcaagt 15600 

tatagcattt tttacaatac ttgcaaaagg 15660 

acacttcaga aggggatcaa atagagcttg 15720 

ttgttttgtt ttgattgtga tggagcacca 15780 

agggagtaaa cggtggcaaa ggacttaaga 15840 

ccatttgtca aatttaataa tttaacaacc 15900 

taaaaaataa tgaataataa tataaataca 15960 

ctacaaatta cactcaaatg agtcacttaa 16020 

ttatgtggca agaacattat actttgttta 16080 

caagtgacta ggaattttat taccaagaac 16140 

gttctaacct ctaacaacta tatatatatt 162 00 

cctatggaat ctcatctttg tctctaattt 16260 

ccacatgtac ctagtatcat ttgatcataa 16320 

tgctaataat gttatttaaa acatgtacta 163 80 

atttcttgat attcatacca gatttttgtc 16440 

tataattttt tatgagtcga agacaaataa 16500 

aaagcagtca caatatagca atctatcata 16560 

taaaatgaga taaaggctga ataatttgta 16620 

gattttaata accacatcaa aagaataaaa 16680 
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aaaaaattat gtttcagaca cattcaccac 

cgattcatat gtaccaatta ttaatacaat 
tctaccagtg ccatgtagac gtaggagagt 
tagcgcagtc gcgcagagag attgagagag 
cggagcagag acctgagaaa ggacgcagtc 
ggcggagcga gcgcaacggc gcggagggac 
agcggagacc ttagcgctgc gacgacggtg 
ggagggccgg accgtcggag ggagacgcgg 
ggcgatatcg cgataggatt tatcctctgg 
atttttttta taatattaat ttattatata 
ccgtcccgaa tatgattagg ttttataagg 
ttatccaaat taattgggtt agattggatc 
ataatcgatg gaatttgaca attcatgaag 
tttctttttt ttattttgta aaaaaaacat 
taatataaaa tataaaaaaa tgttagcaac 
gttaaaaatt attaaaaact acaaaaatta 
ctcacaaaaa tttataaatt ttaataaatt 
ggatatagag attaatgagg ggttttctaa 
aggtctaaat ttatagtcct catgtaaatc 
tctattagag ctatctatat aatacggtgg 
acaatgacta ggactctcgc aaatattggt 
gaacgagagg taatatatga tccatccatg 
tgctcatatc aaagggggag aggggtgacg 
atgttttgat ttttgtgata tttttttaac 
tattttttta ggacgaggaa ggtgctgtaa 
caaaaccaat gctttcggct tctgaaatac 
gttatgaaga gtttgattgg aaaatcaatg 
taattaaaat tggtgaaata ataaaatcta 
aaacgaatcc ccaacaaacg atatgaattt 
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agfcgatggtt tcttaaaaaa aatcatctat 16740 

aatgcagtta aatatgcagt aagcattttg 16800 

gacagagagg ttgcggcagc atcgccggcg 16860 

gatggagatg cgaagcgcca aagagaaagg 16920 

acgcagtcgc acaaccgcga gggaggaaga 16980 

ggagatgcga agcatcagag agcgaggtgg 17040 

aacaaagagg ccgagtgcag tggcgcggaa 17100 

agtcggagta gtcgagtaga catagtgggg 17160 

gaggaataat taattaattg gtaatatatt 17220 

aaggtttcat cggactcggt attatgtaac 172 80 

ttggattagg tttgattcaa taaaaaaaac 17340 

attggattat ttgtacataa gtttaagaac 17400 

ggtatttagg caatgtttgt ctaggttttc 17460 

tttgcttaat attttttcct taatgattag 17520 

acactctttt taacacatat ttttccattg 17580 

gaagaaaaaa ttattaaata agaaatgaga 17640 

ttaactaata atagtgttat taacatttct 17700 

tcttttcttt attaaaaaaa tatgatgtaa 177 60 

acgtgtgtgt atttggaaaa tattcaagta 1782 0 

agtggagttg agtcatatag cctctcacct 17 880 

gcgcaaactc cttgcttctg tatttttaga 17940 

agtatgcgaa ttgggaatga tgcatgatt't 18000 

actaatatat cttatagtta aatagataaa 18060 

aataatcgtt gaatatcaaa aacttattat 18120 

atgaaaagga agtaataaat tgaagtttat 18180 

gagatctaca taaagacaat tctaatgaat 18240 

gcaaaaggaa gaaaatttat taactcgatg 18300 

tattaaaaaa aaattacgca atagctcttg 18360 

catgtagaaa tttggtactt tgatacagaa 18420 
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agaggcaaaa tgaaatgatt ttgtcatttt 

caaattctga attgccaaat tttgggaaaa 
gtggtgatct tgaatgttca ttaagagcta 
aaaaaatctt aaatatactc ctataaatat 
aacgtgcaaa aatcatacag gattcgtagc 
taacctaggg aaccaaggat caagttacaa 
gactatatag accaaagaac aaataactgt 
aaaatttaga caaaaatttc ttttcaacca 
actaaagaca gaaacgggta tcacttgtta 
gagggtccac ttttactttc cggcaacaga 
actacccgat caatctggat tgtcactttc 
aaatagatgg gttaagcgga agcctaaaac 
tacaagtcca atttttccac aattaccttt 
tgaagctttt ggccttttgg aggactctca 
ggattagggc cagttgagac cacctgatca 
tttccgagta ggaaatcaca tcaaaataga 
ttgtgataac cacagattca gaatgcaaat 
agaaattgaa gtgggtgtgg ctcttggcaa 
gtaacaaatc gcatttaagt caaaagctac 
gaatctggat tctaaagtct ggcaacagaa 
tttttttttc taggaagtag ctataaaggt 
ggcataattt cgataaggac atagcagata 
gagatacagt catatattag tctcacataa 
tagtgcacta gtggagaaga agatcacaaa 
aataccttgg tttgccttgg aggagtgttt 
tttccaacag attgcttcat gttgttaccg 
aaacactgca ataaaaattc tgccttctcc 
tactgcagca aaggtatagc atctgcagct 
ggacatgctg accaagcttg cctaagcagg 
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gagctttgat ttattagtag aaaaccgttc 18480 

tggtaatcca gattgatctt ttgtgtgagt 18540 

atcattaata gtatttattt atcacacaca 18600 

ttatatataa acaaggcact ttaaattgac 18660 

agacctaatc ttcgtgtgcc cctaggcatt 18720 

aagaatatta ttttaaaaga atttgtttgg 18780 

ggctatgtac actagcaaga taaataatac 18840 

tgtatcctgt tttccaattt caaagctgtc 18900 

gaccactgaa attctatttc taaattcctc 18960 

gtgtactctc cagatactgc acctagcatt 19020 

ccaaatgaac tctgtggtca gataaaaaaa 19080 

caaaataact gaagatggat agactgcatt 19140 

cccattttgc tcttgttttt gcaagaaatg 19200 

aaggaccatg taaagctctc atcccactcc 19260 

gcacattctc atggtaatta gcacattcac 19320 

aagaaaacat gattttccac gaattgtata 19380 

cattgtccct tgcgtttaaa acaatagaaa 19440 

gtgagaacca tatacttccg tatcaatatt 19500 

ttcaaaagga aagaaaaaaa caatttgggt 19560 

ctgcaatgga atatgtaata gtcaactgga 19620 

ctatatccta attgggcatt ttctaatatt 19680 

acattattta ttatgcaaaa gattgacatg 19740 

aacaagcacg tcatccattt caactaatcc 19800 

gcaagataat aaatacaaaa ttatttttaa 19860 

ccgagtgtaa gcttgcaaaa aacacttggg 19920 

cgcttgatga tcaccaccaa tgttcctggc 19980 

tgaaagcgag gtgggccaga ctggattaag 20040 

gctattgact gagctcttga aacttctgct 20100 

aaaagtgcat tcaaggcagc ttcctgagta 20160 
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gcctcagagc ctgtctttaa ggacgtaacc aaatggggaa tgctaagtgt tgctggttca 20220 

gttgctctca gtcttgggaa gttgctaaat agagaattta aggctttaag atactcatca 20280 

ttcacagatc cagtagccca taagtccttt tcaatagcag ctaaaagaac agacacaaaa 2 0340 

aatgttaggc atatttacat actgatgaga gaggttatca aggtgaagca acttattgtc 20400 

agacagacag aatcaacatt gctcaccaat gaattcagat tagaaatagc acctttatct 20460 

acagacaaga ctttcaaatt caacctatac tttctctgat acattaatga atatttacag 20520 

aagactaggc gcctatttta gctagaagag gttatataga acaactaaga gtcaagctgt 20580 

tcaacttatg aagcaattaa gtgcactaca agttttacta ttcatttgtt ccttccccaa 2 0640 

atttttggtt tcattaccaa taatcaacca aaaagaatca gctctgctat ttttccaaaa 20700 

gatgtattaa aagtcacatg tagcattttt gttttttttt tttattgaaa atcaagttat 207 60 

gcttctgctt ataatagcaa agaaaaatgc tttactatgt ctatgactat gagaatcaat 2 0820 

tatttgattc aagaacaaga aaacaacatc agaaattatc atgtgaaaag taacaaatac 2 0880 

accattctat atgatgaaaa gcgaagataa gatataccgt agtggttcag agtaaaaata 2 0940 

acaagtattt aacactgcac gataggaaaa taactcacca gttattgctc tgactgtttc 21000 

actagaagca tactcttgaa tggtatgatt tgaaaataga agtttaataa acattgcagc 21060 

ctgaacagat gtttcaggat cacttgaacc tatcagatcc agtataacct gaaccccacc 21120 

cgcctctgca actgctcttt tatttgatcg actgtacatc acaaggtttt gcagagcaca 21180 

tatggctaca actttcattt cttcagttgg ttggtcttca agcacattca ctaaagcacg 21240 

acaagctgaa actgcatcac ttgtccgagc aagaccctca ttctgaaata gatcaccaag 21300 

agccaaagtt gctaatagcc ttgcttgttg tgcttgggtt tgtgggtcca agagataatg 21360 

ggataatggt aagatggctg acttggtaac ttttgtttct ctgatcttta cattgttaag 21420 

caaaacttct aagagtcttg cagcagtttc ctcgcactga tgagatctta ggagctccaa 21480 

tagagcctct atagcaccac tttcagccat tgcttcagcg ctagttccgt catcactttc 21540 

caacacaaga agagcattta atgcaccaac aactgtgctt tctaagccag accgaagcag 21600 

cctaaccaaa acagcaactg gaacttccaa gtagtattca gaactaaatt gtagaatact 21660 

agccaaaaca gaggcagcag actcccataa agcatgagga atggaaggat cagattgcaa 21720 

tataactttt gaaatttcaa taacaccacc ttcttttgca atttcatttg gccatatcaa 21780 

tgcaatactg accagggcct ttacagctct ttgctgcaat atatgtatac cagaaccaag 21840 

aactcgtata agagggccaa ttacctgttg cgtcaccgga tctttctgaa ggtgttcttc 21900 
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taaaagtaga tgcgagagaa gctcagcagc caactgttgc actgctgata ttggagaatc 21960 

aagcaaaggg ataagaggtt caataacttg gtgacaagtc aatgtgtaat cagcacgaca 22020 

ctgcggatgt tctaagatat taactaaaac ctgcaatgca ctatgttgtc catctggccc 22080 

aaattcttct cttgtcagca acataaacag gggttcaact actttagcag ccgatggtcc 22140 

tttagctatg ctagcattat tggtcaatat acgcaacaat tctgcaaaag ctgcacatag 22200 

ataatcaggt gcttcatgaa ggatgtcaag tatgctttca ataactccag ctttcaccat 22260 

ttccatttta caagctggcc tgtcttttcc taacttgacc agagctctgg aaatggcctc 2232 0 

atgaagtaca tgattcctac cataaagtag gccaacaaga ggaataactg caccatgtgc 223 80 

agcaactaat tcagccagtt gctcatcatc aacaagccta tccaatgcac ggacaactga 22440 

atggtgagca ggactaaact cacttacaag gagagagacc agaggttcaa cacaatgtgc 22500 

agcagccatt gtagaccgaa tccttgtatt tccaaaaaga acagaacata attcagcagc 22560 

atcccctttg aggtccattg agcaatctga tgagagaatc ctgcaaagaa catccactgc 22620 

attcatctca acatctgcaa cagcaagtgc ttttgaaggg ttttcactca gtaacctaac 22680 

caatgcagca attgcagcat gctgctctct ctctaaacca gtattaagaa tctccaccaa 22740 

gggttgaaca gcttgtcgag cagtctctgc atttctaata tggtcagcag agaataaact 22800 

ttccaaagct tttgcagccc tataccttgc agctcttcct cctaaacgca agacagctac 22860 

aagttgagta acagctccaa atgctgactc atgtctccga atttctgcac tgctaaacag 2292 0 

aattcctaac aaatctgtag cagcttcttc cgttgcatct tgcggaccta gtgaaagata 22980 

cttggaaagt gcctctaggg cccctgcctc aaccattaaa atcttatttg atggacaatc 23040 

tatagaaagc tgagtcaaaa gcccaagtgc taaaaatggt gctcctgggc gttctggaat 23100 

tggtttgagc agatcaacta gagcaggtat tgccttccga gaagtggcac caactcttat 23160 

gtcatcaact ctaaacaacc tctcaagagc aacttgatca ggataatgca ccaaagaaaa 23220 

ttcatctgac aattccagaa gatcctgtat atcagaatca gcacagccaa gcaaggagat 23280 

aagtccacct gctgccccag aatttgccac agacagaagt gttcccctgc taccattaca 23340 

gactaggcta gctattgatt gtgcagcaaa atatctgttc gctgattcct ctgacttcaa 23400 

taagttggca agggctggta tagatttcat ggttgcatgc gctcgtatga tatccctatc 23460 

ttgaaataat actgccaaca acaaagcaca aatccacata ctgctatctt ctttataatc 23520 

aatctgcata tcatatcacc ataaaggagc aaagatttac acacagaaat aatagttata 23580 

tactaatctg ctatctaact actatgtcaa tgaggagggg tgtatttgta aacaactttg 23640 



14 



WO 01/51627 PCT7US01/00552 

aaagatatgg atgcgtctat gcaaataagt aaatatttac tatggaaaga atcagatacc 23700 

actgtctatt tcaaattgac tgaaaacatg cactagaata cctgactata ttgtgagaaa 23760 

caatctgcaa tcctgtcagt gagaacctca attgctccag cctccattat tgcaatttta 23820 

cttttttcat catggcaagc aagaacagac agtaaccata tagctaagtt ggcactggaa 23 880 

ataagagcgg tgccagtgct tgatttacaa tcattggctt cttttgtatg cctgcaaatg 23940 

ctgataactt ccctgctgtc atcaccctga ttatccaatg tagcctgtga aaaaataagc 24000 

atatcgacca gagactgaac aagattagca cacaagtttg ataaattgag gtcctccacc 24060 

agcttctgafc gatttacttt agcagcacaa ataagaacag cagctcctcc aattttgacc 24120 

tttacatttt tagaagtgga attgattatc cttttagcta tagaagatat acatccagag 24180 

gcagtaacaa cagtgtctcc cagaacaaaa ggctgatcct tacataatcg agataatatt 24240 

tcaatagctt tatcctgcaa cactggcgtt gaatcagcaa tagacaaaac tattgggatt 243 00 

atgcttttag gaaattcagc taaaactgcc caagcaggtt tactgtgtgc actggtcaaa 243 60 

tctgacctgg aaagcatggc aagtgcttca agagcttctg atgtggaact atgttcatca 24420 

atagcaaagt ctaagaaaga aactaatgca agaacagtgc cagctcgatt cacacaatca 24480 

gttacagaah aatcaacttg cctcttagaa tgtaagaggc gagcaattgc tgcagcagca 24540 

tgtgtttttc cagaaattgt gccttcacgt aatactctgg tagcagccaa gatgacttct 24600 

tctgcaacag ctttctcagc aatttcacta tccaaaataa gatttgccac agcacatgtt 24660 

gccagctctg ccacttctaa aactgaagag ttagctagtg cagctaggga gggtaatgca 24720 

tctctagcaa tagcagccat atccttgttc tctttgatgg aaagaaatat agcagccaag 24780 

cagcgcgagg actccattaa gatactttca gattcaacat fcaagcaactt cattgctgac 24840 

caaagagttt taacagcaat gctgctttca cgcacatctt tcctcgtttc aaaaattccc 24900 

gccagagcag atgcagactt tgcctgagtt tcttccttgg ttgagctcaa tagttttatc 24960 

attgtaacaa ttgcatcact agcagcacta ccttcacgta gaagatcagt aagagcaaca 25020 

actgaaagca tacttctcaa agcatctaaa acataaacct tagattcagg taaatcacta 25080 

gtcaataatg ctgtaagctg actgatagtt gctgtatcag atttatgaat taagtgattc 25140 

aaagtctttg ctgcaatctc cttcccattt gggcttccat tcttcaacag ccacagcaat 25200 

gcaggaacga cttcagcact ttcaacacaa gcacgtatat cctcactgtg atcgcatagg 25260 

ttcctaagga ttgttgctga gtcttccttt gcttttgcag atcctgactc caaaatttga 25320 

acaagaggag gtataccacc agcagcagta atggcccatt tactttcatc attttcatta 25380 
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gataaaaggc aaaggagggc aacagcacat 

agagatatca acagctgaac tccttcacgg 
gtattgcaca atgtcaagag agactttaga 
gttattaaac caacaagcaa acgctttgca 
ggattgctat acaaactagc tagggcttca 
tgaggtttaa attgttcaag taatgtctgc 
gattctgcct tgtcatcata tatcataaga 
gcttgagtag gagaggagca tgattcaagg 
aacccaccag agatgtttgc taaagcacac 
tcaccctgca tgaactcttt ggaaggagct 
ttggaattgg ctatctcttt ccttgcatct 
ccagctgcct cagccctgac aggggcatca 
tttgttgtct ctgctgtcaa caactttgaa 
gcaagtaaaa agcacacatt ggctaaagtg 
aatatgtcca ctcctccagc ttgtattgtg 
agatttttca atgccccagt caataaatta 
tgcagctgct cccatagcac tggaacaact 
tgatctttgg caccaccttg agagacagca 
tcagcagaac tggactttag tagaccaagc 
ttcaccctca gctcattttc tttacaaagt 
acattcaatg accctgaccg gagaagggaa 
ccaacagcac tgaaagcatt ttctctcata 
tactctttct cttgcataga agatgagctc 
cttgctaatg tcccatctgg atcctccatg 
attaggaacg taagatacaa aagtttgcac 
atagaccata tatcaattat caaaccatca 
tgagactatc tgagggggtg agaggggaac 
ggataacaca ataagaggga ttcagaagtc 
taacagcatg gattttagct gagcattata 
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tcttgctgct gttctgatga aagccccaga 25440 

ccttgaagtg cacgccatag actgcattca 25500 

agctcctctt gcacttcatt ggcagccatc 25560 

tcagagtttg taagtttaat tgacaatata 25620 

atggtccgtt cctgcacaag gaatggcaag 25680 

tcaactacca aaggatctga tgcccaggta 25740 

gctgaagcta aagcccccaa tgtgtcagca 25800 

ctttgaccaa gactggagat gacatacgac 25860 

atagcatttt cctgtaacgc ttgagcatac 25920 

atggtagcat taatcaaagc aggaatgcca 25980 

ttgcactggg cagatagaga tttaagagca 26040 

ttaccagggc ctaaaagttt gaggagttgt 26100 

caaacagatg catcctccat catcatacaa 26160 

ctagactgcc cagttgtcag taacttaatt 26220 

gcattccaga atctctcagt gctgctggag 2 6280 

tcaactacat ttccagtctt caggcccttt 26340 

ccttcagttg aaaaaatttt tgatccaaca 2 6400 

aaaatagtct ttgcagcagc tacttgacct 2 6460 

aaaggaggaa tgcaacctcc aagcaagact 26520 

gagcccaaga cagttgctgc ctgtattttc 26580 

acaagtactg gaactgcctg agaatgagat 2 6640 

tctataagct ccaataactg cttcaaagaa 26700 

tgacgcagct gctcaatgca ttgggcaaca 26760 

ctgctgctgt tgctacgttc tctgcaaaca 26820 

attagatttt atgctcaaac tcaaatttcc 26880 

agttagatag ttctcattat cttctatgta 26940 

attttttttc tgccttagaa cttttgacat 27000 

ttagcttagc tgcaaacaga agtactaatg 27060 

agcaattgat ttggtgattt tttaagccat 2712 0 



16 



WO 01/51627 

aaaatattct ttttggacat aattaaaagg 

catcttcaga actgaatgag gaggtagagg 
tctttcctgc aatcaaaaga atcacagtga 
taaatgatga aagtaacaaa agttaacata 
aatatggata aagatcaacc ttgaagttct 
aaggaaaata gccaaaccat atatcaaacc 
attttcaaag tattaaacac acatgattgt 
aaggatagca tcttattgca caaaacagag 
agcattaagt aatgcaacaa agcatttgta 
ctaacccatt aagatagatg gtcatgaatg 
ccttttattt ttcattttct agcattaccc 
taagcataha ccaaatgatg gtagagtgca 
gaatatttct catcaacttc ataatattta 
ggcaacagat atgttccaat gggatataag 
tctttagatt tccatggttc tcatcggtac 
aaaaagtgtt taatacaaat caagggttgt 
aactgccgtt gcatctccct tccactgtct 
ccgaaattta aaaacataaa tgaccagcaa 
tacaggccga gttaaggatt cataacagtc 
acttttcctc caccagtaat gatgatgtga 
aataatggaa aaacatgaag aagtaccata 
gcagcaaacc tccaagtcag tgtagtcgcc 
ccaagcagat caatccagga cactgctccc 
cacatcaaag cctccctgtc cacaccacac 
tgcacagaaa tcgcaaatgc aagttcaaca 
acactgtaaa aaaaaaaatc aagctaacaa 
gacagtcaga gagattggag gtagcagatc 
agcgtgcagc ggggtcaacg aaaaagtcaa 
gaatgctagt gaaaattgaa gaattgaaaa 
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catatcctaa gctgtaacca acctcaatcc 27180 

ctctgaatct tgagccttcc catcaccatt 27240 

gcaagtttaa acggaatgtc tgtacatctg 27300 

aacatagaaa caatatatta gattcggtac 273 60 

ataagaggag aaatggaaaa tgaagttgaa 27420 

ttgacttcat tttgtatcaa gtaattgaag 27480 

tgcttccgtt acagttgact aattatcatt 27540 

gggagatata aaatgaaata aaaactatac 27600 

cattacacfca gtgtagaaca attaccataa 27660 

ctatattttt aagagaatct aataatgcca 27720 

tgtactcaat cctagtgcca aggatatgta 27780 

tgactaaaat aatgcttgct acctaaaata 27840 

taaaaccatg agcatctcaa ttcaaagaat 27900 

aaacagataa cgaaatgaaa tcacagagcc 27960 

tttctagaat tatttgatac aaaaaaaaaa 28020 

tgggctagtt ttagaagtct ccacaaccac 28080 

gcaatttcta aaatcacagc ataacctcga 28140 

gcaatctgaa gcgataatat atccaatgtg 28200 

aaaactaatt gtttatacta attctcaaac 28260 

tgacataaaa tgaataaata aaaaaggaga 28320 

tcattagcag caagggtgct gccattattg 28380 

agcttcgctc cgcgtcactc gaattctcct 28440 

tctccagatc tacactggag agttcgtcag 28500 

ggtaattcag atcagaaacc aaacagcaat 28560 

agttaaaaac tacacagtga cctaaccaaa 2 8620 

tggaatcggt taaaattgaa tcgcgcgaga 28680 

aacagctact accgaggaat gggagtggaa 28740 

agcggttacg acggaatgca ggaatcgaat 28800 

aaaaaacgaa gaggtgagag agaccttgag 2 8860 
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ggagtgagaa gcgaagcagg tttgttggtg 

ggcggtggtg gtagtgggag acagatttga 
agaaaagagt gcgttatgtg atttttcggt 
acaatgagca ccgacaacag ctccagcctg 
ggctaggctc agccttcagg ttcgcttgag 
taaaataatg tacctctttt tacattttta 
tcatcatttt tcttttttaa atttttttct 
tttctcttct aattccaccc caatctattt 
taatcttttg tatgcatatt tttattagtt 
attaataagt attataaata ttatattaat 
ttataagtat tataggtatg cgttatattc 
atatatatat atatatatat atatatgtat 
tgataggaca caaatcattt tttgtgtttt 
ttaaacttta atgtaaaaat taaaaaaaaa 
taccatgtga aaatcactat taatttacat 
actctcttaa aattttcagt gattgaattt 
attataaatt agatggttga ggttattaat 
taccgaatgt catttttgta ttcatgtttt 
atatttttag tattttaaat tttaaatttg 
ggttctttga aagtatatat attttttata 
gtgaagaaat taatgtatac aataaataat 
gttaaccaac tacttcaatt aaagttttaa 
tacgaagaac tcgagaattt tattgaccat 
gtttatgaaa taaaaaatga taccagtgca 
aacgtggtat atatatactg gccattggaa 
gttagtatta agttttgtca aaataaagtt 
gtttttgaca tatacaatat tattatatat 
gtactaacta ttattttctg gcaaatatat 
cgattttaat agaaacatta attattaatt 
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tgagaatgac gcatggattg gtttagcggt 28920 

ttattgatta ttagaggtga ggcttcgtta 2 8980 

tttgaattga cggagaaggg agtgtttgtg 29040 

caagctcata gcctcgccgc gtcatttaac 29100 

ccacccgttt ggaatagaat cacgtggaaa 29160 

tatcatttct cgcatggaca aatttgtatc 29220 

catcagtaca cttatcattt ttattattat 29280 

catttcagag tagataataa tcaatactta 29340 

ttaatacgtg tgcagacatg tacaaaattg 29400 

atcgctgatt aatatttact aacataaata 29460 

atattcatat ttgtatttta ttatatatat 29520 

acatattata gtaaacccac ctcatactaa 29580 

atgtattttt ttaatattaa ttaatttcta 29640 

tgactttttt acaattttta agatttttga 29700 

gtttcataaa ttgtaagttt gttgaaaatc 29760 

tctctttgaa gatttttttt cttctcttaa 29820 

tttggaatac acgtaattgt cctttggaat 29880 

tcaatctcaa atacaaataa atagttgctt 29940 

cgtaaagaaa aatagattct ttgaaatgat 30000 

tggcttgcat aacattgcga attttaactt 30060 

gtttgcctaa gaatattttt tattaaatac 3 012 0 

tgttccaact aagttcgtat atattaacga 30180 

tttttttttt gctgagatta ttgaccatat 30240 

aacattattt tcaccaattt cttacagaaa 30300 

aatagatgac taactgttta tacagagttt 30360 

aatgcaaata ttatttcttt atttacagta 30420 

ttttttagga aagaatatta ttatacttaa 30480 

taactagtgt tatttatttg tgagtacact 30540 

accttttctt tgtacagagt taattaacat 30600 
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ttgatatata cttatatgag attttaattt 

tattttaatt taaaaattga gattagacat 
atagtttttt taatatatat atatatatat 
ttaaatctta tgcatcaatt acatatcagc 
tgcattcgtt attggaagta attaaagatt 
ttttagattt cttttaataa ttttttttag 
gttagatcaa ttcaatcctt ttaatttttt 
atttagttaa aatcaatagc tttgtaattt 
acatgacaac aataatttag taaataaatt 
ataatatata ttagaaaaaa atgttgattt 
attttttttt actagtaaac tgtttactta 
aatatttttt actgaaaaaa ctttacttta 
aaattaaaat gatttgaaat catatataca 
tttttcataa aaaaacacat tatatattgt 
ccaataattc gacttaaaaa aaaaacttca 
cattttagct gagcttgagc tcttgattgt 
ggaaagtcat ctcaattata tcccattttt 
ctctcactct attgcacagt ggagcgtttg 
gtagtaataa aggcatataa gaaaatagtc 
gaagttaaag acaagctacc aagcattcat 
ctggcaagtc atgaggctag attattatgc 
acttgcaccc tgtaagcatt tgctgttatc 
gctagctaaa ccacccatta cttgttttat 
ttaggttgca tattaatgtt aatttccttc 
tgccccttgt tcatttgcac acatccttat 
gaccaagtgg cttttgttat ttcatcccat 
ggctctcctc gttttttagg ctctgtttgt 
gaaattgagt tgggcttgtt tggtataact 
ctaagcggtc aataattatc attataacac 
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gaatttaata tattttatat tcagatattt 3 0660 

tcacatttac aaaagataat cattttcata 3 0720 

atatatatat atatatatta ttaagaaaat 3 0780 

cctgctcttc ccactaataa aaagcatata 30840 

tttgtgattt attttttcca aatttgagaa 30900 

gttaaattaa tttttccctt tatttattta 30960 

aaattggtgt aatattgtcg tttttatccg 31020 

agtcccttag ttttgaaagc attgtaattt 31080 

atttatccaa ctcgaataaa attatcttaa 31140 

actattataa tatttttcac ttgtatttac 31200 

aaagtgattt tatcaccatt tacagataat 31260 

atctcatttt taattaaaga caaatttata 31320 

aaatattgat ctgatccaaa ctcaattgat 31380 

tttgacccaa taattaacaa gaaccaaact 31440 

attattcgtt gcatatgacc acatggtttt 31500 

ggccaagcca tagcctacat ccaaaaaaat 31560 

ttgtgcttag ttggcaaagg tatattttct 31620 

atggcaatgg aatcctttga tttttcaaag 31680 

ttttattata agctgtgaga ggtgggtgca 31740 

gttattacta gcatatatta tatgcaccag 31800 

aataaaatca atgtgggtca tgttattcgc 31860 

atgcaatcat agggaccctg tacagtacca 31920 

tattctagtc ttaaattaaa ctaacagcaa 31980 

gatcgatttg tctaatcccc tatactgtct 32040 

tttttttttt tttcttaagc agatccagat 32100 

ttctcaaaat actgattcaa agtcgagttt 32160 

ttgcgaccac attggtaatt tggtgcatat 32220 

aatgcatgtg agaaataatt tacgaccaac 322 80 

taaattattg ataaacattg tattgggctt 32340 
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tcatttaaga gtgaatatgt gatgaattgg gtcattcatt gtgtagctcg ttgaaaaaat 32400 

ccttacttgc tactagtgag ttaaaagact aacatcattt ttattttcag tttcataccc 32460 

atgtaaaagc tctctagaaa atgggcaatt tactctcaat acaataattg agttctctca 32520 

tgagttgaag agccatgaac ctaataataa atctaggtat tcaggtacac gtttattatc 32580 

ccctctttgt cgtaattaat atattgtaat ttgtaccctt gtgtgatttg gacattctac 32 640 

acacttgctt taaaattcat caattggtat caaaacaact tagatacaaa ttatggctat 32700 

tgtttttcat tgttctttca tcaaaacatt gtcaaaacgt gggttttttt ggcttattgg 32760 

ttatttttct aagattatta aaaaacttgt taatttttca tacattaaat ctataattgt 32820 

ttgagggttt ttggtttatt tgcttattgg tttgtgttcc ttatcatgct cgcgacaaca 32880 

tattaaaagt gtgatttagt aaactaatta taaagaattt gatggttggt tttgcaaata 32940 

tccttccacg aaccctaaac atttcaaacc aaattactca cagatatagt ttttttcaaa 33000 

aattgattga gagatcgtgt agaaaaagta aatttcaagc aaaactcatc ctcaatatcc 33060 

tttttgtatt ggtttataaa catgagagaa ttgggaaaac gcttttgttt tttgatccat 33120 

tctcacatct gcgattttgc ttctttttct ttctttcttc catgtatgtg tttccaacct 33180 

ttcttctgaa attgattggg gattctctat attgatagtt ttcaaacaaa atttctcaag 33240 

ttttactatg tatagggaca ttttatcaaa tccataaccc ataaaatttt attttgttgt 33300 

catttttgtg aaaaatatgt actaaatagt ttatctgatt acatgtctta gtgagtgtct 33360 

gatttcgatt aaaaaacata tgtatttgat atttttcaca acaatttcag actaaagaca 33420 

caaaagccta atatacttag tttagtgtga tgcgcatcac cttaggtgct catacatcca 33480 

aacacttaca cacttgtttt agtaaataat gttatattct atctcaaata aatccaatgt 33540 

ttagaataaa tccttgattg acttattatt attaatatgc tcatttataa attattgtca 33600 

aagataattt tgttgaacaa attttttaaa tatttacaaa atatacgttt cttgcgttgt 33660 

atatgaatca tattgattta tagttggagt ttttagttga tatcacatta catactttta 33720 

gtgtgtgatt ggatagctcc aaaatcatgg ttgatcatga tgaggtaagt caatcttaat 33780 

ttttatcaaa tttacatgtt tagattattt ttaaaattat ggtagaatca attatgatgg 33 840 

aaaaaaaaat catggtagaa ttgaaacaaa taggagtaat ttcaacttct atcaaaatca 33900 

gttatgtaaa atcatcatca atctaaacac acacttatca tgttatcaaa gctaactgat 33960 

acaaaaaaat tataaggtga ttttaaattc tatggcccgc actaattttt ataggatttt 34020 

gctaactaat atccttaaaa cattagttaa aaaactaaaa gaaaaaaaat acttatgatg 34080 
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gagattataa gaaagcataa aaaagtcata 

taattcttta atcttttaaa taatattcta 
agatatatta agtgttatta atattttaca 
aagatattat gcatattaag gactttcttt 
aaatatcaaa tgttaactcc gaaaatatca 
attattcaat ccgatgtttg tcctctctgt 
cgacctacat ggacctacga cctgccctac 
ttaagaggtt aggttcaagc ttttttaaaa 
gcttgttaaa aagccttata agtctgatag 
atatgtatat atatatatat atgtatatat 
cttatattat tttttgggta caattaattt 
attactgctc cataacttcc attcctataa 
tgaattatgt acccctttat attacccatg 
ttttccatta actttcctat tacgttccta 
caggccaggc taggttttta aaaaggtcag 
tataggccag attcaggcct caaaaattaa 
tggtctagcc tggcctattc ccacccctaa 
agccacgcgg attgttattg tgagtggcaa 
attacttaat gcaatcccat aattttaaat 
ttctaccttc tttctcattt ctccttcgtt 
gttcgtatac attttaggca aagggagtgt 
tttgcatata ttttccaaat agctgaattc 
caaaaaaaca tttccaaaat catgttaaca 
tttttctgga aaaggtaacc atgaattatg 
agtgtttttt ttttgttttt ttgtatctaa 
tttttttgga ttacatattt tggagaatgc 
ccaacaacat gccattgcta tcttcgtcag 
tgaggaaggt ctcggctccc aaaggaaaag 
tttgaaccat tcaaattgtt ttaaaattcc 
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aatgataatt ttgtacatct caataatttt 34140 

agaacactag ttaacatttg atatttttat 34200 

tgtggtatgc ttaggcctag taagcccttc 34260 

tcaaaagaaa agtaataatt atatctataa 34320 

aatgcttagg cctagtaagc ccttcaagat 343 80 

atggtaacta ggggtgggaa taggccagac 34440 

ataaagtctg gtctgaactg acccatttaa 34500 

agcttattaa attaaataga tcagacttag 34560 

gtcggcctat atatatatat atatatatgt 34620 

atttatatta tttgttgata ccaatttata 34680 

ttttttgaaa ccagcagact ttgattatac 34740 

tcaagtaaga ctttaattac catttaggtg 34800 

atttttattg gcttttttgt taacgtttcc 34860 

cgtcaaagca aacaaatata aaaaggctat 34920 

gccgagccaa aataaaagct tttgataggc 34980 

tcgtaggcta ggctcaggtt ttttaaagtt 35040 

tggcaactca ttgagagcgt aattgggcca 3 5100 

tttgggggca attgcttcct acacctctaa 3 5160 

atatccaaat tgaccttatt ataacttcat 35220 

gacgcattat tgaggtgtgc agaaggagtg 35280 

gaagtgggta ttttaactat ttttttccaa 3 5340 

ctaagataca aaaagaaaaa aaaccacttc 3 5400 

ttattcctaa attgaatttt tggaatgcat 3 5460 

ttagcataat ttcataaatg cttttttgga 3 552 0 

caagctcaac tatctgaaag gtattagaga 3 5580 

ttaaaatcaa caaccatacc agtcaattca 35640 

gtgcaaaaag aaaataaaga agaaaggggt 35700 

gaatgaggta taagagacac tcaagggcat 35760 

tatgggatgc aggaagcaaa ttaaggggtc 35820 
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taggaagtaa ttgtccaatt tggttttcca 

ataaatagat gcactagcta aattttgttt 
ttttgtttct catttttcat ccttaccata 
cttttagttt gggctaaggc cctgtcatga 
agtggtccca acctctctat ggaaaagtat 
tctcctacca tctcatccac tcccccccca 
gaatccctat cccatttcct ccatgccgca 
acaaaagtcc ctactcgtgt atgatctcca 
ccatcataac tatgccgcca tgaacattca 
cacatttttg catggatatt tggctgcata 
tgttgcaaaa tataacaatg aaatcafcgat 
tacgataaaa ttgtgattct attgtgttag 
acaatttcat tatacatttt attcaataaa 
aataaaaaaa cttcagcccc ttgataggag 
acttccctcc tgtgttgggt attagaactt 
cccagctcag gcacggggag atgaaattct 
ttatcggaat cacaacgaga aaatcttaat 
ttggtattac ttcagggttc cctcacggaa 
agaaaaagaa ccctcaacca aacaacaaca 
tggagaagca atttctcact cctgaaaatc 
ataaaacatt ttgctgatac tgaattttat 
tttttttttc tgcttcgttg ttttcaggga 
attcggtttg gttcttcgcc tcgggactcg 
ctggcgcctc tttttctatt ctcaaattag 
ctccgcaaac tattatcatt gtgagtaatt 
atatatgaat gtatttttat atataagtta 
gatcttgccg ttatcaattt gatgtccagt 
gtttaaattt taataaatag tttgtatgtg 
cagtgtacag tgattttagg atgaacactg 
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acaaaatcat gcttatggtg aatccactca 3 5880 

gtctcaatgt gttccttttc atctctttca 35940 

tgtttgctaa agtgtctctt tttctagaaa 36000 

ataaaaatag aatgctcttg gggtgttgct 3 6060 

attttgcccc tccttcttcc tacttccttt 36120 

cctattatca cctaatcccg accagatcta 3 6180 

caccactacc agccttacca tcactattgc 3 6240 

cttttgaggc tccatgggcg caactcctcc 36300 

cagttgtgca ctcctttgtc ataatggaat 3 63 60 

aaatcgtgat ttcgttggat taagtggggg 36420 

ttgttgttat atttagcgtg gaaatttttt 36480 

gggggttgaa aactaataaa gaaaagaatc 3 6540 

atcacaattt catataaagg atattcttgg 36600 

tcaagaagca attttaatag ggcaatagta 3 6660 

ttatatgtcg gcccaaataa gaaacagaag 36720 

cgattatctt tcagccacgt gttaggggca 36780 

gttagcaacc gctgtggaac cgagacccac 3 6840 

cggaacaaaa ggcgcacaca atcgccaaag 36900 

acaacaaaaa aaaagcccta gaaaagattt 3 6960 

tcatcctctt cttctttaca tttgttttgg 37020 

tttctcttgt tttcgtttct tttcgacctt 37080 

gaaagggata ggagtaataa aagagagtgc 37140 

aatcttcgat gcctctctgt tttttcttct 372 00 

ggtttccgtg agcgttcctt tttctcgatt 37260 

ctccgttctc tttctttcgc ttagtaattc 37320 

ttttttgttt gaaatttcat ggtttcctta 37380 

tctctttctg caccaaattc tttcagtgtg 37440 

ttagtatttt tttttcatcc gcacccagtt 37500 

tgattgttgg aattgattgt tgtttgtttc 37560 
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ttggtttttg tgaatttgat tgttggcttc 

taatgtcgtc aaaatatcag attcttgaca 
agctaaaaga tgagctgaag aggaggaagt 
tcaagcgttt ggatgaagtc cttcgtcttg 
atgaggccaa tggtttcgat ggtcatgtag 
tggatgctga aatggttgat tcaactgaca 
aaaagggcaa gagcggtgta gttgatcctg 
aggttgtgga tcacgacagc aataagaatg 
acattaataa tagtgtgtca gccatagatc 
gtggagattc tgcaaatgtg ggacaggagg 
ctaccattac agttaatgag agtgtagtaa 
atcgtgcaga aaagaataat gaggattcag 
cgcagctgga tggtgaggac tcaaagcccc 
aggatcctgt gcccgactct tctgttccag 
tagggtctca agtaaaatct gattctattt 
atgaactaaa ggatactata attactgata 
cagagatggt ggaagaacca tcatccagaa 
attcaatgga tgttggaggg ctgcatgaga 
atgttttaag tccagacatg aacaagacca 
agttgaactt ggacagaagt tctggtgatg 
agcaaattga ttctaagttt aatgttgatg 
ctattgtgaa ggaggaaagt agtactatag 
gtgatatcca ccaagatatt gacattagtc 
atggtaagca atatattgtc tgaaattgac 
gggcaagaaa gaattatagg cactatatgt 
ctttgtattt tttaccatcc agtattgttt 
tgggaacaat gagcctgcaa aaaggcaacg 
agatgcacaa agcactactc ccaggcctgc 
gaaacgcaac ttctctaggt ctgattcctc 
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aggttaggga aggtgatttg cgtttggaca 37620 

accgccctat taatcagtgg aaggttacag 37680 

taagtatcaa gggtttgaag gatgatttgg 37740 

aaagggaggc tgacgaggct tctgagaagg 37800 

atggggaaaa agattcagag gcggttactg 37860 

gggacaatgc caaaacattt gagactgctg 37920 

ttgaaataga aaatgtggaa aagattccgg 37980 

acaagcagga tggtgttacc aaccaagttg 3 8040 

aagaggttga acctaagggc ttgccagctg 3 8100 

caattgccca tggttctact gtggagaaga 38160 

cagaagtagt ggccagtgct gaggattctt 38220 

cagcaaagct ggagaatgag gagtcaaagg 38280 

agctggattg tgacatcaag cccttgcatg 38340 

aaaaccaggt atctgaggtc aaccctagtt 3 8400 

ctactgattc tctgtcaatt aatcaaaaga 3 8460 

atgtcaaatt agaacaagat attgttaagc 3 8520 

atgatgtacc tgtttcttat gatgaatcac 38580 

aaaaggcatc tgttgaagaa aatttcaata 3 8640 

atagcagcga tgatgtgggg tatcccgaag 38700 

attctatgga agaggatttg cctgagacta 38760 

aacttaaaga caaaatagag attgaagagc 38820 

ctgttggaga tggtctgtct gcaggagaaa 3 8880 

cagttgctcc aacagagaaa cgaaaattta 38940 

actttttttt ttattggttg tactcctgaa 39000 

tgctttactg gcatgctgtt attatttctt 39060 

ggaacatttt gtcacagaac aagcatcagt 39120 

caggtggaac actgaaacag ttaaaggtcc 39180 

tactacacct agagatgagc caattgcttt 39240 

tgcaactgat gatacaccta aagaacgcat 39300 
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tggtaagtca acattgttgt gattagggta 

tgtgtaatgg tggtgttaat ggagatcaaa 
ataaattcct ggcaccttca gattttaatc 
tatcttatcc tctaaaatgt atttgtggag 
tggttgattt tttcatgcac ttgtgcattt 
ggttgtcatg cgctttttta catgggtgta 
atggattact ttgttgtcat agtggcatgt 
aaatagtctt tttttaatct ttggcagttc 
tcaggattga tcgttttctc cgtccattta 
agactgggaa tgtcagcagc ttctggatgg 
taagataact cttattgctg tcttccttca 
tatgacacgg ttctgacatc taagtaatct 
tctgtggatg aagccattga gacacggaat 
ggtgggcgtc tcttagttgc tgagtatgtt 
cctcctccta ctcaggctgc atctgtcagc 
tcacagccag agccttcccc tcgtctgcac 
ccacctccac caccattgtc aaaaccccca 
ccaccccttc ctgagaaagt tgacccaccc 
accacagcca ctcctcggat ctactatcta 
cttacagcac agggtaaaag cgcgaggcag 
cccttgcagc ctttactgat ttggatattt 
cttggcagat ctggtggtgt tttgtctcgg 
tatttagttt tagaagtcgt gaactaagag 
tttgaactag gcttgagcga acctttacct 
tatgtttata ttcctagcca cgaattttgt 
tattttctta tgctgagtgt cgattgattt 
tctagccacg tggtttgttt gacatcaagt 
gttgactcgc caatttatcc accgtttggg 
ggtacagtat ggcatggatg tttattgtga 
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aatctattaa gaactaatgc tcctgtttgt 39360 

tttcaattcc tagctttgtg gctcaaataa 39420 

ctggacactt cttgaagcac aagttgcata 39480 

gtttttattt taagatggac aactacctgc 39540 

actgagataa gacttagttg ttgacttgtt 39600 

ttttgtctat gttgtacagc atgttttttt 39660 

gttgtattgt ctgtctctgt ttttcaagtt 39720 

caccaccaca aaggtcccca actaattccc 39780 

ccctaaaagc agtgcaagaa cttcttggta 39840 

accaaataaa gacccattgc tatgtgactg 39900 

actgtaaatt ttcatgtgat gttgttatat 39960 

tttgttgatt tcttggtgaa tcagtactcg 40020 

gctgtatata atttgcaatg gccaccaaat 40080 

gatcctgaag aagtgaaaat gaagttagaa 40140 

actgtcccag cagttcctcc tgcacctccc 402 00 

agggagccgc atccagttcc agctactctc 402 60 

ccagtagcaa gagaacggct tccatctcca 40320 

attgtcactc tggatgatct ctttcgcaaa 40380 

cctttgtctg aagagcaagt tgcctcaaaa 40440 

tagataggcc attagcctgc aaatcttctt 40500 

taattagtgg tatgggatta ttttgaatgt 40560 

gaagaaattg cattgcttga gacttaaatt 40620 

ctaccggggt tggatagatt ttccccactg 40680 

aaaattttat ggaaaaattt gacgttagaa 40740 

tttcggtagc tggctttttc ttgatccgtg 40800 

tagtctgttg attgagcata gtagatttct 40860 

ctagcaccgt aggaaggttg atttgtatgg 40920 

agaatgtatc gcaagtttgt tgttattttg 40980 

tttggttttg atatgactta tataagtttt 41040 
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tcatccataa aaaatatggt gttttcaaat tactacttaa tcacgtcact gaaatgtgac 41100 

taatggtata actfcgaaaca aaaaattttg aggtaattat ttgataaaaa attttttttt 41160 

agataataat ttaaaaataa tttatttttg aggtatgaaa aatttattta aaccttttga 41220 

ttttggttgt tattgtgaaa atctaactaa attgcatatt aaatttatat taattttctt 41280 

ttacattctt atttaaatta tcttttctat aacttaagct ggaactcatt ttttataatg 41340 

atagtatgtg gttttttttg tattgaaatc ttagttatat tgcattattt atatataata 41400 

ttgatactgc ataattagtt aactaatgat ttgtaatgta tttttttatg aagtttataa 41460 

taatttaatt gttgtcaaaa accacttaag ctgaatgaaa actaaacggt gaaaactaaa 41520 

aaaaaaaaga acaacacggt gaacatccct actactaagg acgttgtatt tgtttgttga 41580 

gagttgtggg cctgtggctc cctaggtaaa tcagtaaaga aagactaact attcatttcc 41640 

ctccttaaaa gtgttaaaag tccttcaaag atagaaaatt aatttttagt ctctcaatgt 41700 

ttaaaaagtg taataaaatc attctataac tttttctgtt aactatcacg attttaccta 417 60 

acacttttcg attaaattgt gataaagaat gtacatatgt gttgtttggt tagcaaaatg 41820 

agtaagagac ttatttgtca gccaaaagtg gatggattat tttgtctctc tcggagattt 41880 

cttctatttt ttctcccccc atttctttat cgttacaatc ccagcttccc ttccaatttg 41940 

ttttcacgca gttcatcttt ttcacctgga tgctagcaaa tagatctggt tgacccgtga 42000 

actgaaccgc acgctgcaac catttttggc gaccaaatgg agtgggaagg gatgcatttt 42060 

aatcatcttg tgaagggggg aaaacccata accaattttg caaaaggttt gataaaaaaa 42120 

ggaaaaaaaa aaacacaaag aacaaacatc atttgcacaa caatacgctg gatcactttt 42180 

ttttggcaaa ttctttcatt tttgaacaaa taatcaagtt ctaatatatg acttgctaag 42240 

aattggttta tctacaaaag cccttgattg atttaaaaaa atcattgagc aatgagcgag 42300 

aaatttgaag aaaatgaaag aggaacacgt ttggtttgag caacaatttc ataaatgagt 423 60 

gttttttgca attactcaag tttaagtatg ggttttgctt gaaattgatt gaactacaag 42420 

aacactcaat caattttaca aaaaagaatt caatggagca aaatcaagcg caaaaaggga 42480 

aaaaaaaatc ccttttaacc ttatatgtta ggagtatgct tttgggatcc aatttttaac 42540 

ctgtttcaac cttaattgca ataagcccat tggtgagaga agaagatata gggaagagga 42600 

gaaaagagat gatgggaggg aagttgggat agcaacagta aaaaaatggg aaaagaaaaa 42660 

aatacaagaa atcttcaagg gtaacaaaat agttcaactt ttgggtgata aaaagttcat 42720 

cctactcatt cccccgacta agcaacacac atgtacattc ctatttggtc aaaagatgtc 42780 
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gtgtatacaa aattgtgaga gttaaaagta aaatcatttt gtcacacttt ttcaatatta 42840 

agggactaaa aaataatttt tcatctttca ataattgaaa tgtcagaatt tgacactttc 42900 

aaagactgaa atgagtattt atgataaaaa aaaactaatg tcacccattt agaatacaaa 42960 

tttgagcatg aacatgaata ttcttaataa aatattgttg caaagtgtgt agattgaatt 43020 

tgagaacatc acaggtttac accacaagaa aggatcaggg aatttttagg aaaaaattct 43080 

tgagtaacca aaaaatgaaa aaccaatagg tgtagggata gtttgggttc caacaataga 43140 

gtgtcaaaac ctgatagttg ctcaagtcaa tgaatcttta aagcatctcc aataactttt 43200 

cttacaatct aatctttcat gcaagatctg atcttgttaa gatttattac tagagtaaaa 43260 

ggattgattt ccaccgtgga gatctgttct tctacaagaa taagcaagat ctgcatgtgt 43320 

tttcaaaaaa tgatattata atgtctctct tcataaaaag tgtaaataaa tattagaagg 43380 

tgaattcatt gtaggaccta caaaaagtgg tttaaaatca caaagcttga ttttctattg 43440 

aagcaaaaaa ttggaaagat ctctagaatg atatagtatt gcaaggtgca ccaaaaagtt 43500 

gacttagatc acaaagtttg atcttctatt aaaaatgtct ttagcgtgtg tttggattat 43560 

aatttacaaa gttaagttta aggtaacata actttttaac ttaagtttat agataaaagt 43 62 0 

aagttgaaag taacttgttt tatgtttgga tactttgttg taaaagtact tttgacagaa 43680 

tgaaacttat ttggattatt tggatgaaaa gtacatttga tattgttgca aaggaacaaa 43740 

tggatgttga ggggttataa taaaggtcag atatggcttc tttaaatttt ggcaggtaag 43800 

gaataatgaa aataattatg ggtttaatgc tggttgttta gaggtaattg ctggaaggag 43860 

ttattgaaat ttttaaaaaa aaccttatct ttctaacgga tccaacgaac gttaacggaa 43920 

aagtaaccta tagttgcttc ccattaaaag tcggatcatg tgcaattcaa gtttagttga 43980 

accctcgttt gtaatggtct cacagtaagt tcagtgcaac ttctactccc gtttagagac 44040 

tcaaacgaaa aaacaaacac aactttattt tttttttcat tacgtgcgtt tgtacaacta 44100 

ctatgaaaca aacatgaaaa atgcacaatt tactcgtaga aataataata ataataaaaa 44160 

taaataaata aataaaataa agtatttttt gaaaaataaa agattgttag tactagttac 44220 

tacggagtat cgtattcgta tcatccaccc tgcggcgttg aaacaatttt catggttttg 44280 

cgccaaataa aataggcgaa gaaccctaac ttaaaaaacc tcttctcttc aagtaacgcg 44340 

tctttctttc ttcttcaact aatttgcatt aaaagtatcc tcaagtaact aagaaagcaa 44400 

tcaagaaaga aagcttcttg tattcattca ttcattcatt atgtgtgtgt gtgcgagtac 44460 

aaaatttatt tctggaatct gaggtgtttc atatagccat gttgcttact tgctttagta 44520 
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tgttaattcg gttaacaatt ttttgggtaa gtagcatagc tttgttttac tattctttta 44580 

gcatattagt tcttcgggtc aatggaagac ccattacacc cttcaacaga aaattgacac 44640 

accgagatgg tggaacagca agtaaatgta aacttatgtt ataagaataa gccactgaag 44700 

ctagataatc agccaccaca ttagcttctc tgaaagcacg aatttattaa ttgttaacag 44760 

aatacaatga cattatttga tccatgtagc tggtcttacc tagtggaaaa ggcttctgtt 44820 

tgtttttgtt gagtgtatta gtcattaatg caacgtaagt tgttgtcttt aacgtttgat 44880 

atagctttat cacattggtg tatatttcac cggagcaaag aagaactggt tgttgccact 44940 

tggaaaaaac aatttgacaa atcagagatg attcaaaggg tcctccttct gtatcttgca 45000 

aatgacattc tgcagaactg taagcgtaaa gggaatgaat ttgtgacaga gttttggaag 45060 

gttcttcctg cagcacttaa agatgttatc aagaaaggtg atgatcatgg aaagcgtgtg 45120 

gtatctacat tggtaagccc tttttacact atgtgatgca agtatttcaa ctgaatactt 45180 

aaatgggaaa gggtgtaaaa gaattgaata taaaatcata gcagatagtt cttcatgcat 45240 

ggctgaaact ctagatatgg tgtataaatg ctgatcatat gatttgttta ccatgatatt 453 00 

gatctgttaa cttagttctt tactaacttt agattgaaat atgggagcaa aggagagtgt 45360 

ttggatccca ggctaggaac cttaaagatt tgatgcttgg agaagatgca cctccaccat 45420 

tggaatttgg caaaaagcga tcacgttctg taagaattgc gaaaagggat tctcactcca 45480 

tcaaatcggt gagactgagt tactgagggc ctttgcaact acatccttcc tctccccatt 45540 

tcatacttaa atcattgatg ttagatagtt acatccttca tagaggtttg ctatggttct 45600 

ttcatatctc acttccttga tattgtcttg atgccagaaa ctgtccatag gaggtacagc 45660 

agaaaaaata gtgtctgcat ttcatttggt gcfccagtgag caatccgctg aagatgcaga 4572 0 

gatgagtaag tgcaaatctt ctgttcagcg tgtgaggaag ttggaaaaag atgttgatac 457 80 

cgcgtgctct gttggtaagc atacagcaga acagagttac agaagtaaca ccttaattat 45840 

ggaagttaga gttctaatat tgtttctttt ttcagacaaa gatcctaaga gaaatacttt 45900 

agcaaaggaa ctagaggagg aggaaaatgt tttgaaacaa tgcatagaaa acttaaatta 45960 

gttgaagcaa gcagagtagc ccttgtatct cagttaaaag aagctttgca tgagcaggta 46020 

agttatacac ttgtattcaa tattttcagt attttattga gttttcggtt gttcctaact 46080 

gtgtctttca attgtttgtg gatttttgta ttaggaatct gaactagaga atgttcgtac 46140 

acagatgcag gtacgtactt ttcatgattc ttttccacag aaggttgtca tattttgttc 46200 

atttatcttg gtttgttgcc atttaattac ttctctaatg gtacctcttc tgcatatatc 46260 
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cctaggaagt aattctgaat attctccttg tagtttattt tcatgcatat gaagagttaa 46320 

ttttcatgaa ggtgttcatg taggtttaat tatttctctt atggtacctc ttctgcatat 463 80 

aaagagttgt gtctggaaga tgtttaaata ggaggagatg ctttgcctga tgaaatttaa 46440 

atggggaaga tcagtgtaat ttctgaaaca agatagggtg tttgcgttaa gtttcataaa 46500 

ccttgacgga ggttagtgta ataatcttga tttatgaaga ataaacagtt ttggttaaag 46560 

aattgcttag tgaaagcacc gtacacaatt tgagagtaaa aaatctcttg ttaatatttt 46620 

aatgccgcgt attacttgtt gagtctgttt tcattattgg catttggaca tgtttgcggc 46680 

tttttagcta atcgcaacga caaatttgaa ctttattatg cttaaatgtt tgtttaagga 46740 

tatttaggtt gcacaggcac aagtagaaga ggcttgcaac atgcggaaga tacttgacaa 46800 

tgaagattcc tctcagaaaa catccactgg aaagtccgct gcagtaattg ctgccgaggt 46860 

tgcagataag cttgcatgtt cctgcgcaaa aagattggtg atcgtaaaaa ccattcaaaa 46920 

ttaagatcgt gtagtgttgg tttgagttgt ctaaaatata atgatgtgaa agttttttaa 46980 

aaaatcgatg gaaacattga cttggttcag aattttcttt tctatttgag agccaaacta 47040 

aatacgacta atttaatttg aaataaaaag aatcaattaa tcaaattatt tgatgaaccc 47100 

ctctataaag aataactttc tgtgattatt actatttaag tcataagatg acggggggga 47160 

taaaaagtac tcgcagtatt ttataaatgg gctgaaactc caacatcaac tagactgtcg 47220 

atacagtttc atactaaact aaatctgtac atatattgtc atcgttatcc agtttgagaa 47280 

ttgaaggatt gcaatttgca aagaggtcat aatccaccct cgaagtttcg gaaagaggaa 47340 

gcaagcaata agcagaactg cacaactctg cattgtgcat tactgttttg gaacagtggc 47400 

aggaactgta ccagatttgt gcttttaata actgaagggg acatgaggct gtctgtctct 47460 

gtcacgctcc aaattaggat tcaagttaag aagtttttga ggatgtaatt caattccatt 47520 

cctatcataa gagccgtgta agtcaaactt atgaggtggg caaccataca aacatgaaaa 47580 

aacacattta taatgttgag aaaaactaac aaatgcagaa aaaaagtata gctggtctat 47640 

gttcaaagaa tgggaaagtt tcagacacat tctcaagaaa ctatgtcctc agtttggtca 47700 

agttttattc aaaaactttg ttcactaaaa tatgataaaa tggaactaaa tagaaaatat 47760 

ctcttctgtt gctaaaggta aagtaaatgc tttataaggt gtccatgtgg aaagatagat 47820 

gcaccaccta actatagctc agcctttcat tgctttcttc aaaacaaaat ctctaacaga 47880 

gatcggtaaa tgatacatga tagccatgac agtagagtaa tgcccatagg tgaaccatgc 47940 

aggtggtttc tccttaagga cagcagctac agtgcttata gcaaactcat ctgtaggggt 48000 



28 



WO 01/51627 

tgttttggac ttctgcgaaa aataagctct 

taatttccat tctggcatgc ggttgtagct 
gatagctccg gggacaatat tgacaacatc 
taccacatgc tcgtgttaaa gatgagaaca 
ggccagccac cgaatcccta attgttgcat 
catatataga gttttacaaa ctccactata 
ctggaccaac caaagcctaa aagcttgttc 
ggtggatttc tgtagccaat tactgttgcc 
aggtgtagag taacgcaagt ttgaagaaga 
aagagcagct ttagaagctg tataagctcc 
actaccaata ttgacaatct tccccttttt 
aaccattctc aaggaacctg catgcgtatg 
gcaattcatg gttttgtaca ccaaggcaat 
gaaaatccaa tgtacattct tgaatttcaa 
atgcttgaat gaaagttaaa aatattgtgt 
tagaaacaaa agcaaagatc ctgacaaaat 
aaaaatcaaa cattatagtt tgaggagtaa 
aattcagttt gcgcataacc tttgaggcca 
cagcaactta aatagagtgg ctggttccaa 
gtagtgaaca gaagccatct gcagaatgat 
taatctaatc caacatggca gattaacaac 
ccaattaaga taaagaatac tgaccgaaga 
agagaggagc ctcggcgagt gggcccacac 
cgatgcgacc gtacttgtcg acaacagcat 
gaacatccaa ctcttccaaa aagaaccttt 
gcgacctgct ggtggccacc accctgcact 
ggcctatccc tcccgtggaa catcctgtga 
catcgtggcc gcgcgtgtcc atcgtagaat 
agtgtcaact tgttgctatg ccaacaatga 
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atctcggatt gctgcttcaa aaggcttgaa 48060 

ggctatggca gaatctccaa tgtttgattt 4812 0 

aattccaaaa tgtccaagtt ccagtctgta 48180 

attagataaa atgtaaataa ttggcacaaa 48240 

ttgcatccac tagattttga ctttgaagtg 48300 

ttgcagtttc catgaaatca acacattgat 48360 

ataccaaacc aacaatttct atagaaacaa 48420 

tctagttgat ttgatgcaga aatgaagcac 48480 

aaaatacctt aatgtatctg tcaaagcatg 48540 

tgaccaaggt ccagaggcca aggcagcaac 48600 

cttagttgcc atatgaggaa caaccgcctg 48660 

cccgatgaaa catagcatca agtttaacaa 48720 

ttttggataa aaattgtgaa tgtgattttg 48780 

cacacaaaat aaagaaataa gaatttgtgc 48840 

gtgcgtttca aaaaacagat aaaaataaaa 489 00 

tacttttttg tctcaaataa tttttaacct 48960 

aagtactttt tgagttccaa aacatgtact 4902 0 

caattgacaa ataagaatga aaacaggcaa 49080 

aatgcattat ttatctagct aggagtaact 49140 

taaaatcaat ctaaaacaaa ctcatttggt 492 00 

aaatcccatc caacaaatag catcaaaatt 492 60 

cattggtgtc gaaagtgttt tggatggcag 49320 

actgaacacc ggcgttgtta acgagcacgt 493 80 

caaccacctt acgcacgctc tcatcggact 49440 

ggtcgtgttc cagctccgcc atgctcgacc 49500 

tcttctcagc gaaggcgcgt gcaagcgcgt 49560 

tcagaaccac aggtttcggt ggctcctcct 49620 

acaaagagag gcagaggagg tgattagatt 49680 

caacccaacc caacccatct agagtacaaa 49740 
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acaaaggcaa gacttttatg ccagcttttc 
atgacaagca gctacaacca attttctctg 
catttatgac aactattaat taataacatt 
aaactttatt tttgagttta tattagatga 
gtctttttta tttttatttt tggatacaca 
aactcgggtg gaacggtacc tcgagtgcac 
cttatattct cttttaactg aaatagtagc 
aatagagaaa ataagactca gaagacgtca 
ttattcaagt ttgtaaaata gattttcaat 
gttaagaccg taatttgggt agattatgtt 
aaggcaaact tgactacatt actactatag 
gtttcaaaac aaaattaaca caatctcctg 
agcaatctac tcagcagcat cttttttctt 
ggcatggtcc ctctcaaatg gttcaagggt 
cttcaacttg ttcacttcgc tttcaaacac 
gttagcctaa acatacaaat gaattcaagt 
gtaatacata agagaagaaa ctaaaccact 
tccaatgttt tcaatagagg gcaagcctcg 
tgaaagtccc aagttcaaat tctaaaaaca 
actctcctta gacctcacta ggtggggacc 
tttaccatat tgattactca ataaagtttt 
cttgattgaa ataacaaaat gacctcctgc 
aatccttgcc tgcaatggtc accaaaaatc 
ggtgtttcaa aaatagtggt ttgataatgt 
taattcattt ccagaaaatg taaaaaggaa 
aaccacagcc acactttcct cttaaaccag 
cacactctct ggaccataat ccaattgagg 
tacctgtaac accttgctaa tttgcagccc 
acagtgtata aacaaaatcc tttctctttg 
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ctatatctag agtagtactc ctacaatcaa 49800 

tcattcattt attacggctt atacatttat 49860 

atttttcatc gtcaaaaata agaaaaagaa 49920 

taatatttat gtttttttta gaaaaaaaat 49980 

tactatatcc attcccttac ccacatctaa 50040 

gttggtttgt catgcttatg gacaactaat 50100 

ttaagaaata aaatactact ttttttggta 50160 

cacttaaaaa tcttcttaac aggtttcaat 50220 

acttcagaaa tgtttgtctc atctcgatcc 50280 

atttctttac tgtaagcacg gtaaatattg 50340 

aaaaggtaca aagtatgttt gataaccgag 50400 

aaatgacatt aagcctactg gctaccggaa 50460 

cttgggcatt ctgtatccac caaccacaca 50520 

gacttgctcg aatggtttga attgatctgc 50580 

tgcctcggca gggactgtgg agtctatgca 50640 

gagaacaatt tagcagatag ttaagacata 50700 

taattcaagg taatcataaa catttctctt 507 60 

acacctccat atgtttgttg tttggaggcc 50820 

gccttagagg gtaaaagcct gattccttct 50880 

atgtgcactg ggtcaccctt ttttaaaatg 50940 

ataaaaacat gattagatgt ctgaacatac 51000 

tttcagataa tatgaagcat tcagccccaa 51060 

ataaacgaaa ctgatacata gaatataact 51120 

tttaaggagc ttttatcttc aaatagaata 51180 

tccacaaata taaaacctat gcaccactcc 51240 

ctgcagatat aaactatcat aacacatcat 513 00 

ctgcttataa tatctgacgg cccaccccaa 51360 

ctagaccagc cgggggattt aaaaacaaag 51420 

ccttcatcca ctccaagaat tgaagttaac 51480 
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catctccact atatcacatt ctcatacata 

caaaatgggg ccataaacag gctaagaaaa 
attaaagcaa taaaataata gcaaaaagga 
tctgcatcat ccatgatgtc ttgaagggga 
ttccaattta acagaataca tgcaagtaaa 
tttgagaaac taaaagaaaa acaacaaaat 
tcattttgaa aatcgaaagt tatacacaaa 
acgtgtagat ggctggctaa tatttggcaa 
aacaaacaat aatcatatgg cactgccgat 
tgtattgtct gtaaaaacga gactgaaacc 
ccaaacctga tcgggctgag caacatcaga 
cctatactta gctggatgtc tagcatcttc 
tgccatatta accaggtcac gtccacttct 
agtcttcata caagaacaat aacacattat 
ttatgcaatt ctaaattcta aattttactt 
acagtggttc cagaagcagc tcctaggtaa 
cacaagacaa ataatttaaa aaaatgaagt 
acatgttgta aagacacaat cttacaatcc 
cagccaactt ggatcgaaaa ggattccaaa 
tctgcaaaat gaaactatca attcatgcat 
aataatggtg agaaagaaaa gaaacctgaa 
caccaggaac aagattctta gtaacaagag 
catcatgtct atggggctga accacaacct 
caccacggcc accacccctg cctccgccac 
ctttgaaggg agtgcccctg tctcctcctc 
cacccctacc accacccctg aacccaccgc 
ttcaaacact tgtcaacaac aaaaaaaaaa 
aaagaccata tcatctagtt caaatgccaa 
ccacattttc ttattaaaag gaaaattacc 
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aaattgaaaa aaaaaaagta cttatagatt 51540 

gataatccag cactgctgtc aatttaaaaa 51600 

aaagaaaatt atgcgcatca gaaactcggg 51660 

atccgagtaa ctcatcatat gcacagatta 51720 

aaaataatat tgaaaattgt aaaatacaaa 51780 

gaactctaaa ctgagatttt aaaaattctt 51840 

tgtaaaagag aaaatgcaac aaaaatatac 51900 

ccaaacgaaa tagtagatga attgaaatta 51960 

taagtacatc agtattatgg aaaggtagta 52020 

aaactcgaga tatgttgtaa gaaattaaaa 52080 

aaatatgaca tcaaccattc cgaccagcat 52140 

aataatggga ataacattag ttcgcttctt 52200 

atgggaaaat tctactgcat agacaactcc 52260 

aatcagcact tctattccac aaatctaaga 52320 . 

acagggccaa caacgtctga tacatgagaa 523 80 

aggactctgg ctccaggttt cttcataaga 52440 

gtgatttgaa gaacttcgcc taaacatagc 52500 

atatgttgtc aactccacca aggatggcag 52560 

ctctgtactc atctttggaa ccatcttcgt 52620 

ccatggttca gtacttaaga agaaaaaaca 52680 

cagtaatcct tttctcattg taaacagctt 52740 

cgtcttcttt acccttggca atgaaaatac 52800 

tgctacctcc cttcattcca cctctgccac 52860 

ggccaccacc acctctgcca cctcctctag 52920 

ttcctccacc gccccttcct ctgcctctgt 52980 

caccaccaaa accaccacga cctgaacaaa 53 040 

aaactattac cacaacattc ttacgtcacc 53100 

atgccaaatg caaaaaccaa aatcatcaaa 53160 

tcttggagga gccattttcg ggagctgtaa 53220 
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actgaagtgc aacgcagcag caacaactgg ttaagaggat tttatagcca gccgaacaac 53280 

tgaaatagtt agggtttaat agagggtagg gttttgattt tatagaaaaa gaaaaagaaa 53340 

gtgtcttgca tcatccacta acctaaccct agcggtcggc ccattttcct taccctgctc 53400 

acttgttcgt gggcttttat ttgggttcat cagtcttcct aacaagcatt tattgctttg 53460 

ggccactagc aaggatttct ctttcctttc tcagctctct acggattaat taagcttttg 53520 

atattttcaa caaaaaataa ttaattaaaa tcaatgaact cataatafctt ttttaaaaga 53580 

actaaggagg gtatcaagaa tatgatgttc ttcttgatat attcagcaac aattttagaa 53 640 

tttgctcgag attgatgttc ttctttactc aacctcttat ttgactccat gatttgttcc 53700 

tcatacaaat aaattatttc ttcatttgtt gtaaccttca attcaactcc tccatttgtg 53760 

acaacaaatg attcacatcc acattacttc cacaacccac atattacccc gccaccafcta 53820 

ctttcaaagt ccttgtttaa tgatttcttc acatcgtcat cccaaaccca aaatctatag 53 880 

ttcttaactt tctacaatca caaacaaaca aaagcaattt aaaaccatag aatagaacac 53940 

aacacccaat aaagagcata aatccttaca ttccaccaag cacatcacca aaataacttg 54000 

tttggatgct agttggtttg tgatattaat taaattgttg gcttctcata gagacaatga 54060 

acaacaccac caaatccaat tgacgaagac aatgctccaa atgatgccat ccacgatgaa 54120 

cacgaacacc atgaacccta cccgaccctc ataaaaccac taagagccac ccctaactca 54180 

aatttgctct ctttgacaat aaaaacatag attatgctac tctactcgtc ctagtctcaa 54240 

gtagttcacc taaaaaacca tagataatct aaaacaaaac gataacattt acacaattta 54300 

ctatatgatg cgtatgggca tgtccaccga gacgaaccac atggatctcc cacaaaactt 543 60 

ttgataagaa ttcagtgaga aataaaaaaa tttcaaggat cggtcacgga attcgtatat 54420 

attttaagga tctccaacat gtttaagtct tgcatattta ttttgattgt tataaaagga 54480 

aaagggaaaa atatttaaat tcttaagaaa ataagtgatt atttttattt aattttttat 54540 

atttctttat acagtttgtt ttaaaagata aaataggaaa actactttca acttctcctt 54600 

agatatacca atattatact attctcccct ctccaattta aattaccaat catgtatgga 54660 

gtgagaaaac ctcgagcata tgagaaaggg gtatatgaat atactactat tcgtattatt 54720 

atcaaatatt aatatgttcc atagaaattt acaaattatt tatattctat tattattatt 54780 

tagtaatagt agtatagtat tagtaatgac attggatcgt aaatgtaaat taaatattat 54840 

atcattagta tttgtaggaa tgtaaaattt aatttaaatg aaaaatatta tatgacaaaa 54900 

gttatggtgc aatattgtgc ctggtgtgta tcagccattt ttttgtgtgg cttataacat 54960 
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acttttctaa tagatttttt ttttattgac tgaattttat taaaaaaata tattatttct 5502 0 

ctccaatttt atatactttt taaattaata ttaactaata ataaatcgaa cacgtgttaa 55080 

atagtgtttt gtttgctctt ctattgttaa aaatttactt ttttattggt aaaaaaatac 55140 

tactaaattt actttcattc taattcaaat taattaaaaa ttatgcccta tagcatatta 55200 

tataaaaaat ttaaaaaatt aaatgtttca attttttaaa aatagtttca aatcaagtct 552 60 

tactcaaata aatgtttttc ccatcttaaa aagatgcaaa atgcagtttt aatttaatga 55320 

aaactgaaga atgcatgaga gctaaggatg cattattcat acgtatgaaa tgacagaaag 553 80 

tcttatctgt tttcttggta atgagaaaag tatgacgaaa cgacctttag gagtaatttt 55440 

tttaaaatta taaaatttaa ataatatcta aaaaaaaata agttatagaa tatttaataa 55500 

tttctgagtt cataaattat ttgataactt ttgttttttt ttattgaaga tataaaaaat 55560 

tatgattttt taattttttt aaaattcaat tttaaaattc taaataattt tatttaaatt 55620 

aatattttta ttttaatttc attttatatt tttatatatt attttattta atgtttttat 55680 

atcattttat ttaatataaa taatataaaa atattaaata aaaaatagac aaaaatatta 55740 

aataaaacta aaaataaaaa tataggactt tgtatattgt tttttttcag tgagtatttc 55800 

ttacgagtac ttactgagga aataattagt taaagaaatg ggttttgggc attttgggag 55860 

gcaagcaagc ccatcttcca acaagcaaat tgtttcccct acagcagcac ttggcacaaa 55920 

gtgattgatt aatcgctgtg gttgtggctg tgaccgtcga aggcagcgac ggagtgaagc 559 80 

tccgtacaag ggtgttcaag ccacagaaga agcagatggg aacttgggga ttgttctgct 56040 

gcacccgtat tctattttgg gtggctgcca ggctctcctc aatggttaca ccgctttcac 56100 

ctttgacatg agaggcgttg ccaagtccac cggcttctcc caagttaagg atgttgttgc 56160 

tgtctgcaac tgcctttcca acactttctc tcttcccagg attttgttgc ttggttcttc 56220 

cgcaggtatg cttcattctc agatctctca cttttcttat actgtttaac atcaccccat 56280 

ccgctttttc tacctacaaa tcttacacgt atcattaaat ttctagagta ttcttttaaa 56340 

tttttactgt gaaaaatgtt gtcttttttt ttttttttaa taatctttga actgggcttg 56400 

ttgtttcagt atatcaaatg tctgaattgc ttgattagct cctagctgta tgtttcataa 56460 

acaaatttcg agctttctgt ctccctgatt acattttggt tattccggcc agccttatgt 56520 

caaatttgga aaggatgcct tagttacttg ttgcattgct gataagttta gtttgagagt 56580 

tttaatctta atgccttcca agtttgaaag ttgtttgatg tttggataaa cactccatgt 56640 

gaattaaaaa atgcaatgac agctctatcg actctcagtc tcaggtttta agcttgtagg 56700 
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agttgaaagg aagtatatat cttttgctct tggattcgat atagactgat ttgcagatca 56760 

acttgtttcc cattttaagt gttggaatga tatccaacta attatgcatg cttgagttta 56820 

catggactgt tagttagact tgtggtgtta atccacatgc ttcttgtctg tgacaatttc 56880 

ccccagttca ttcacgagct gctttaggag atgtagtaat tcgcgccatg tgttgcatca 56940 

atgttgaagc ctgaaagagt tgtcatatat ttgagttatt tcttttatta caggtgcacc 57000 

tatagcaggg tctgccgttg atcagattga acaagtcatt ggttatgtaa gcataggtta 57060 

tccattggga atgactgcct caatcctttt tggatgacac cataaagcca tcctacagtc 57120 

accaaaacca cttcattatg ggaacacaag atggattcac cagtgtaaag cagctgagga 57180 

acaagcttaa ttctgcagca ggacgtgtag aaacacattt aatcgatggg gtaggccatt 57240 

tccagatgga agggcctggt tatgatgctg agatggtgga tctcatcatc aagtttattg 573 00 

catcactata aagacaaata actgattgtg ctgagtgtga tttgactctg ctattgggaa 573 60 

ttgggattag tttgtgtaca aaatcttgtc agaatctgcg atttgcactg tttatttgtg 5742 0 

tcccctaagc ccatgccacc ttctaaaatg aaagaaatca gttacccaca tagtaaccat 57480 

ttctgctgat tggataatga gagagaatgg gaattgcaag cataagaacc atgcttttca 57540 

taatttataa tttacaaata acatatctgt tgtctctcta gattgaagct accctagatt 57600 

gaagctaccc gctacactac ttaatagata caaaaagaaa ctacaattat acaataatta 57660 

tatcatatta caatagtcat gccttgttgc caacttcgtg cggatactca gatggagcac 57720 

tagtttctct tggtaattta tcaccatttg aaccgtctgg tactctaaca ccactcgaat 57780 

tatactgttg tttcttccag tgtttttcct gatccattga ctggtcttgc gaaaagggag 57840 

gttctacatc tggctaaagc aagagagttc aaggaacata aagaatgaaa gagtcttgta 57900 

atcaataatc taaaatatca aggaagttat atactagcaa aagaacaaag aaaagaacgt 57960 

ccaaacgtcc acgctttatt cacagccttg catttcataa aaaagttcat tagatccccc 58020 

gagttgctgg cctactagca atatccaaaa attaagatac aattcaacag aatttggtga 58080 

aatttcaatt aattatacat gaacaatttt ctagttactt agttattacc actttcaccc 58140 

tcagacacat ctccgagcct ggtaattgca ttaacaagag cctgttcatc ttcctgcatt 58200 

tcaaagggag agccatcatc attttatttt attttttgat atagagagcc atcattattg 582 60 

ttgaaactaa agatttctca taacataaag ttgtatacta agaaaatgat tgatattcac 58320 

ttttagcatt ttctttgctt tctccatttc catgggatca ggatggcttg cagtaaacac 58380 

cttttcaacc tgttttggtt ccattctatt aggtaatatg aaggagccaa atatttgtat 58440 
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catacagaga agtagccaaa tttaacacac aaatttggaa atttctgttc cagatgcttg 58500 

tacctccttg ataagggtat ctgtatggag tatctcaata tcatttgtag ccttcttcct 58560 

gccccttttt gtgacaaagt gaaatctttc ttaggctgca ccttcatcat tcccctacct 58620 

cttccagaac cagtaactgc accaccgtga ggcatagact tcttcaatcc tctacctgga 58680 

ccgggccggc cacctttacc agatattcca ggatcttgac cctcccaacg aatatcttct 58740 

ggagatatct accatatcga atagatagta actaggattt catcactgat tgagttgatt 58800 

gttagcagca acaacacaaa ctttagtcta attgataaag aataatgaat gtgaagagac 58860 

actaagcatg taactatgta tctttgaagt agaggtgtgg tctgattttt aaagacaatc 58920 

atcccgtata tgcttgaagt ttcattgttt gcattaatta tagatacaca aaaaagaaat 58980 

caccaaagca aatgttgatc caagatagca aaaggacatt gactatatgc caatcccttc 59040 

agccacaaaa acatactaga ttttagtttt aggcagaaaa atgttcattt catcaaaatc 59100 

atgagatact taatacccat ttcaaggtat gggattcaaa gtatgagaca caaggtcacc 59160 

atgggaaagg taagatatgt gaaaacaaaa ggttctagct atgcttgtca ggagagaaat 59220 

gttgcttaaa aagaaaagaa aagaatgaac gacgctaaaa ctagattgta tgtgaatgat 59280 

agacagaaga atattaatat tgtaaaattg acctctttaa gattgaccca ctcccatgtt 59340 

tcatttgctg aattcatatc ataaaccaaa gcatgtagtc cctacaaaat atgtaaatag 59400 

aaatattaaa ggagtccatg cattcctcag tttctaatag tatactataa acacacctca 59460 

gcagcattgt tgttagagga acaagcttat gagactggat tcgaggccag cacctccaga 59520 

aggataagaa aaggcaagtg taagaaggaa gcttgataat tttcatatat tccatttgaa 59580 

ttacaattgc tttatttata ctagttcctg gcaaacagaa tttgtcattt tgtaccaaga 59640 

aatatttagg aaatcattgg cctagtgatt acggttgtga tgctgtcccc ccaagaccaa 59700 

ccagccacgc ttagcaattc cagagcaaat tccttcattc gaaccttctt catacgtagt 59760 

cactaaggtg agagggcctt gtgatgaccc tctttgcctt tcttgctatc tgaacccctc 59820 

tgttctgtgc ctgcacctct gtttgatgtt ctttgctagc tgatgatcct cttttcgtat 59880 

caattataat ccgttataac agcttcatag aagtggtggt cttcaggcca ccttgtccaa 59940 

actttccttc caatccaggg atcataagaa gctccttcag caggtccttt tgctgcaaag 60000 

gcacctgaag aaccccctga tcctggcttt ggcttgtaag tcctgtagaa gggaactgta 60060 

tggacatcat tgaggatgca caaatcaatg ggttttttct ttagaacggg atagaatgac 60120 

catagaggaa gtgtaactca gtattttgtt atcttgatat tccaaaacaa aaaggattac 60180 
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tgatttttgc tttgtggtct tagttcctga tggaggttca tgcttcaatg cggatgatga 60240 

tggttgcata gatggatgaa ctgcaggaga tggtgcaccc aaggataatg aagccacaga 60300 

ttgtgatgtc ttttttttct tacgagatgc caaaacggta gagctggtat tgtgatcatg 603 60 

aacctgttga gcagtatgta ctgttccaga ttggagccca tttccctttc tccactccct 60420 

aacagcacaa ttggatacac cccactttaa aatgcagaaa aaaccactgt ttgctcaatg 60480 

ctatacagaa actgctcact agaatatgat aaaatatagt atgaccttat cctgtggatg 60540 

atgtcatcag cattgactct tgatagaagt tctgtgttct tcatctgata ctcttagctc 60600 

cttcctaagc tctgtaatta aactttcttt ttcctaaagt acataggaac aaaaaaactc 60660 

attgccaatc ctatcgacat acttaataga aagacaaacc aatgaattaa ggcaaaggtg 60720 

cccaagtgat tgcatcagat tgagctttaa aagccttcag gacagaggtg tatgcttctt 60780 

gctcaatgtt atggatttgt gtttccatgt cgttctgcat ccgaggatat ggaccagagc 60840 

ctacaattgc agatcttccg tttccagttc cagtggcatg atcagggcgt tgaaatctat 60900 

ttcgacattg atgggaagga ggaagatcat catctgttcc tgtggttatt tcagattatg 60960 

gaaattaatt gcaagcaacc ttttgaaacg ttcactcata taatttccac atctgaaatg 61020 

ttcactcatc attctatgag atttatcttt atcaaagaaa atttaggaca gtcggacaga 61080 

tcactatcat ggtctgtact taacattgtg cacaacaatt atgcctgtgt gatagatatc 61140 

ttgtgttcta taacgatgat aataattaga aaggagcgac atagatattt tgtacgaaaa 612 00 

aaattgctgc cttgattgat ttgtgtgaat ttggtattgg ggagctggta gtggtactta 61260 

ttttgattat ttatggtatt ccccctgctt caaatgcttg acaaaaaaag acaaacctga 61320 

gaatatgctc atttcagtta tgaatcagac atgactcatt tccacttttt ggcaatatat 613 80 

tctgttcttt tcattctgat tttacagcaa gtgttcatta tcaatgacat ggtctgcgtt 61440 

tggactaaga tgacagatca cgaggaaaaa tgcagctagt tcatgtcttt tgttcttgct 61500 

attttcctga cttgttctca gttagaaccg ttttctttga aacactataa ctgctccatt 61560 

atttagaatt aaaagtaata agaaaattac aaaatcagcc ttgactgtgt tcatttctag 61620 

gatttaggat ataaataagt gacaccaaag ttcaagtctt ttaagtgaag acctcaatct 61680 

atgcataata aaaaaattag gattagcata aaggagaggt caaacgaagt tattcaagtc 61740 

agttgctcaa cacctgcata attttgtaac catacataca atataagcct gcgaagaatc 61800 

accagtacat tatcaaacaa ttcctaactt gaagttaact aagcaatttg ataaagaaaa 61860 

ccaaaatcct aatttccaaa ttaaggtcca aactcattct agtcattatc tattcaaaca 61920 
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ccttctccaa catcacaaag cagctacaat tattctcaat attgagtcaa catctcaaat 61980 

gaaaagttca caaagttgca gtttcaggca acaaagtttc aacttacaac aataacagca 62 040 

gcagccaaac ggtttgttga ggaataagaa tagcaaaagg gctatttcag acaatttcct 62100 

caattttgtt cacacggtca ttttaatatt tgaaattata atttattaga ggccgggtct 62160 

agccaaaaat attagtcccc ggtacatccg catttaaacc ttgaataaga ttgtcttttg 62220 

aagaataata cttataagac actcaacgaa attatttaaa atcatttgat tttattatta 62280 

tgtatttatt aattttacac aaatagtgcc aactaatgta tacccaagaa tctgcttgga 62340 

aaaagttaaa tgtatactca attttatttt gtatgcatag agagaaatag acgaggaaaa 62400 

atataaaaga aaagaaaaaa attataaata aaacaaacaa cataataaaa aatgaaaaga 62460 

agaatagaag gaaaaataga ctcaaaatga gattttgaaa ttatataatt aataatccaa 62520 

ttaatctcat gagatattat tctataacta taacttatgg accgtttgga agagataaaa 62580 

aaagaaaaga aaatattgat tacaatatat ttttctttgt ttaattgaag aaaaataaag 62640 

aaacaattga cttctaagaa taaaaattca aaacttttct tatgttttaa ctttttttta 62700 

atttcaaatt ttccttttct tgattttctt ttcccactac aattgactga gtcagactga 62760 

acgtgatact tagatacata catgcaactc tcagtactac tcctcacctc tatataatct 62 820 

tcactctttg ctcctccttt tcttcatccc tcagaattaa cctcttctct tctttgtgcc 62880 

gaggtattgc tctctgtctc cccccatttt ttcctaataa atgcactatt actccataca 62940 

tttgctcctc atctctcccc actctttaac taataagatt tttcctccgt tgtgtatgtg 63000 

tatgtgtgca taactcgatc tcttgatgat ttctaccaat tggcctcggc cttatatata 63060 

tatattctca ttctcatata tatatatatg tgcgtgtagt ttacatgaat tttgcaggtg 63120 

attattattt gaaagcaaat ttatttatgc tgttagtaaa gattgcagca agattttgcc 63180 

ataattaaaa atctacttta accccattgc cctgcatgta cattttatta ttagtttgtt 63240 

tgatgatttt ggaattttgt ggtttaattt attaatatat atatatatab atatatatat 63300 

atatatatat atgtatatat tttgaaggaa ttgaggagcg gagtatatac gatgcagttc 63360 

ggattaagat ccccagaagt accagatgag aacacaatct cgcgtggaag catcactata 63420 

cagtttagct tccatctgag tgcactgagc atgatggagt atgagaacca tcggagagtg 63480 

agaacactga gcacaggtcc cttcacactg aaaatgataa tgcaaataaa ctgtcccatt 63540 

cttgctcata tctgttgtgt agataaattt aagtggcctc taacaaagga cactgttgtt 63600 

ctccgagtca ccactcgcac ctatgtcatt gccctccctg gcctcttata tgccctccag 63660 
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ttcccaagac tttgctctcg agaacgtctt 

ggtcacttcc aggatcttag accccaaaag 
tttatttaaa ttttcgttac cattgacttt 
tacttttaat aacattattt cttttgacta 
ttacttagac cttcaacttt ggattaaaat 
tctaatttat taataagaac tgctacacat 
cctagaacta gattaatacc ctctcgggaa 
tgcagcaaag aaaggtaata gaatcataaa 
ggttagctaa ctctcagtta cctgattccc 
ggactatact aaagaactgc tacacatgcg 
aaataaattt gtactaattt tcttcccttt 
gacatacgat cgaccatttg tttgtcctac 
cttgtcctaa ttatatatat gaatgaatga 
attacatgca ggtgggtgct caatgcagtt 
tcctgatttt gagcccatag cacatgaaac 
ttcccgtttc tgcatggaag agcagaactc 
gatgacaaag atgataatgg aaagcatgcg 
gatcaaaggc accaacccac ctgaggcaca 
ctacactcgc cttgttgatt ctgctgaata 
ggaattggag aatgttgtgt ctcctcttcg 
ttggaatctc aatcacttgg gccttttgtc 
agaggaagtg aaatatgtga gtgctagaga 
acaggaagaa gaagctgatg aggcctctga 
ttaaatatca atacataagt ctggctagat 
tgttcgtgta ataatttctc tttaacaaga 
tttttcattt tgatattatt atatattctc 
atatgaattg attgattttt taaagagtac 
agtgaccata tgggatgacc atcacctagt 
atgataactg ttgggaaaat ggaatcggtc 
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cgaatcctct caaatgtctt taggaaatat 63720 

ctaggtatct ctcttctcat taatttcaac 63780 

tattgattac tatttgatta tttgattgac 63 840 

caggttatat ctatccataa cgcttgactg 63900 

tttaaaaata atttttgaat ttttaaaaaa 63960 

ttttttgcgt gaatgcaagg ggcaaaagcc 64020 

aaccgtggaa tttacattag atataacagc 64080 

ctctcggatt ctcaacaaga gacatttcat 64140 

ttccctcaag acaccatgat tgaaggcact 64200 

tatttttatt tttttaaatg acctaattac 64260 

tgcgattggg tgcaaaatta aatttattta 64320 

atcctattat ttattttgag aggatatgtc 64380 

tgacattgat tataattaat aattaattac 64440 

cgatgaccca gaaatgttgg aagtgttacg 64500 

ccttttggag ttagggaata tcccagggat 64560 

gagcttccta agggcgtgtc gcatgtcggc 6462 0 

tgtgaagaca atgacaagta gtcttgtgga 64680 

ccaaggaaga gcctatgctg tgatgagtac 64740 

cgcttccctc gtttcctgtg gcttagttga 64800 

ctttggttcg gttgatgttg ggtaccaggt 64860 

tttcatgtct ctcttgggac cccatgttga 64920 

agagttgtgg aatgccatgg ctgaaactga 64980 

tcatgatgaa gataagggcg aggaaacaaa 65040 

catgtatgag gtcttggatt ctcacattta 65100 

gatcgagggt aaaacatatg tcttaattaa 65160 

tactattata atatgatgaa gaactaaatc 65220 

aatcgtatat atatggagag agaataggga 652 80 

aataataagg caagctaaag aaatattatc 65340 

ggaagcatgc atgtggaaca gatgaaatta 65400 

38 



WO 01/51627 

aagctagctt ctcaggaata atatgaaaat 

tgaaaattaa ttagtattgg atatattttc 
aagaatggaa taaataatct tgtagaaggt 
agagggctga aaggctaaag cacgatatta 
gtacatatat aatagtattc tgtttcttga 
acacacagaa tccggtgttg tatgactaga 
agtattgaag cgaatatfccc tcttagctgc 
gaaatggcta agatgctaaa catcatatag 
tgaagaagac gtccaatata tttaactttt 
ttctcgatat atcttgaaag gtttttaaaa 
gtttctaaaa ctacaaaaat gttgtataca 
tgattaaagt tcgctgatat aacttttatt 
tgcaagtctt tattaataat acaatacatc 
ttaacagttc aatcctgcta gataaatgct 
aatgcttttt taattttttc ctgtttgtaa 
gacatttttt tcactttaat catgggctaa 
taaggaagtt taacttagtt tgtttcaaca 
atctgaattc ctacatatta aaaagaatat 
tgataaaaat aagattatgc aagatttaat 
tcttttaaaa ctatataaag tcgtctattt 
aatcatgtgc ataaataaat tattgatctt 
agtttttctc agtatccttc tacctctagt 
tactctctta attggtattt tactagtttt 
ctttactatg atttttctaa caatatatta 
tatttgtaat atttgttttt attaatgttc 
attaagatac tcattcattt catctatcat 
ttctcattct atttgattga tggatccaat 
acaacttata atagtcatac tactattaac 
aagattacta tcttaggtgt agtgtatatt 
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ctatgtcaat atgctagctt ttcgcaatca 65460 

gatatgtcac aatacaataa tgcaggagtg 65520 

aagtcagtca -ttgaaactca catgtggtag 65580 

tatacacgat ttcagacgag gagcaagaca 65640 

atttccagaa aatcaaagag ccatcgagga 65700 

tgccaaattt gaattactaa atgcacgttt 657 60 

cccgaaatga tgaatggttg gaacctggtt 65820 

gttattatat ctggtcttat gtttgtcaaa 65880 

cccagctagc tgctgtaaca tatattgttt 65940 

attcaaagtg attatttcta taaaaaaaaa 66000 

ttaatataat taataatttc agtgttagat 66060 

taatgaaaat taaattgtta gtacaaatat 66120 

atactaacat atccaaaata tttatactaa 66180 

tttctatata tacactgaca gaacagtttt 66240 

taatgatttt ttgtttgttt gttagaaatt 66300 

atcttttata gtaaacaaag aatataaaaa 66360 

atataagtaa attattgtaa attctcttat 66420 

tacaaaaata agatgtatat taatttgatt 66480 

taataaagca aaataataaa aacaattaag 66540 

gaactaaagt tatcataaca ttttatggtg 66600 

ttaataatta tttttaatgg attcatcaat 66660 

aatatggcca tcatcatcat ccttttgatc 66720 

tacatagatg tcatgtctac ataactttat 66780 

tcttgtatat tcactcattt agtgcatcca 66840 

aagacactcc ttttttgttt tttaaagtta 66900 

gcttaaatcc tctagttaat ttacatttct 66960 

acttcagcat gatgcatcgt attattggtg 67020 

aaaagataaa cttaacgcct tgtgcaaaag 67080 

ttgtgttgtt ttgtcattcg gaagataata 67140 
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ataacttatt tttatcttac tttgcgacaa 

caggtatttt gagatgttgc ataataatat 
ttttggaaga aaaaaagttg aggattaatt 
caatacgttt tcgacttgat cgataaaagt 
cttttttcta atggtgtatt gatatcttta 
gttagacttt actttcattt taataaaatt 
taataagtga catttgtttt tgtacaataa 
atccaatttc atgttgaatt taatattgtg 
atgatggtcg atggtgcaag aggggatcgg 
ttgtactcca atctaataaa aatgttctat 
caggaaaaga aaatgaatat gtaagtatga 
aatcatggat acaaagcagc aaagccctct 
catgcatgtt acaaacgact aaaagcccac 
cttctatata gtaactaaac aaggtttttt 
atcatttaaa acttaatcac acaaaacatg 
ggtgggcaag tgaaaatcca tagttttggg 
ttatgcccta aaggcattat cggatgtgac 
ctttcccaca ctagaactag ccacctttct 
tctttccact tccattctct tcaacttctc 
agctgaacgt gccccaatgg aagacatgtt 
attgcaaaaa agtttactaa ttattagact 
cattgccaaa gatggtattt ataggagttg 
ccacgttcat ttctccaatg cccttgtggt 
taacattaat tatcttaatt gcactcacac 
ctttgcatgg aaaattaagt tttggtccgc 
aatatataag atagacaatc ctaaggctcc 
taagtattta aaatacacat ggtttggatc 
cgaagtgggc ttcttagttt aattatgtgt 
gatgacatgt cagcttgaga tgataatata 
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ttcaaaatct ggtatgtaat ttgtgtatgt 67200 

ttccgcccac tatatacaat ttaactctct 67260 

gcaacaacta tatggtgatc cttttaactt 67320 

tagatcataa acataattta aggtcatgtt 673 80 

tttatccctt taattttact ttaattgtaa 67440 

ttactattaa aaaaatataa acttatccct 67500 

ttaatgtgtc ttattttaag tggtggttag 67560 

gaacaaaaag gcggtgttat atatatatat 67620 

aagattatac ttaaagaagc agaagcatac 67680 

atataataga aaatacggat acgtgctgta 67740 

ttcggaatac atatgcacca tgacgctttt 67800 

tttgacataa caacacattc aattttgaga 67860 

gccattaata acatcaacag tcttttatca 67920 

ttcggtgagc ttaggggagc tagtcaaaca 67980 

tagaggtata gaaaaaatac aaaaggtaat 68040 

gagttatttg tatcccaaag tcgaaaacca 68100 

tccttgttca ctacctggat gcaccttgtt 68160 

ctcttcagta ctcacctcac ctcttcgtac 68220 

tttctgcctc ttctccataa cataaatttc 68280 

gttctcfcgca aaaacaacaa actagctaat 68340 

tggctactaa gaaatgcttt atgaaattag 68400 

gaatttgatg agagcgtgtg gaataagcag 68460 

ttgtggcttt gctttacaac caaccataat 68520 

aaaaaaataa aatgccatgt tgtcatcata 68580 

gcaccatctt tatgtgtttt aaactttcta 68640 

aaatgcaaca atttgtatta atatattaca 68700 

acacaggaag tggattaaat ataaaatgac 68760 

catccaaaca ttaaccacta ttgccacaca 68820 

ttctgtggga ggagttggag gctgcacaat 68880 



40 



WO 01/51627 PCT7US01/00552 

ggcagatata ttgcatgccg ctgactactg gtaagttacc tccaaaataa ttgacctata 68940 

atattgcaaa ggccattaag ttataagatt caagtgtttc aagcatatgc agtaacctaa 69000 

aacgggttta tatataacga atatataatt tctcatcgcc gagtaacaga cgtacaagaa 69060 

tgagacagag agggaatgta agaatcattg gtacttaaat ctttgattac atcgatgaac 69120 

aagcgaattt caggttttag acgggatgct ttaattttgt ttattattaa tattagttca 69180 

tacacgggac actttaattg atgaccacta atcgtggaaa ctgcagcttc tgtattagcg 69240 

ttttaggtag gattttaata ttatgcacag caccaactaa gcttttagac gtatattatt 69300 

aaatttatat cttaaaagca taagccatta actgacagct aggttatcat aaaaccagga 693 60 

gagcttatct ttcaatctaa cactttgatt gcatggtatt tctcttcgat ctaatttaag 69420 

ttcttctttt tcttcagtga ctattaaata aattgtatta aataattata actcatactc 69480 

tcgatcaggt caacttagcg aaaataagat tggatgatag taatatatat gttagtagtg 69540 

tacatcgatc tactctaaga ctaaagagcg tgtaataata taactaacca gttgataaga 69600 

ttcatatttt acaagaggtg aatttaaatt cgaatgtgta aatgtaaaaa cttcgttggg 69660 

gagtaattta aaaatactta tgtaagtact ttgagtttta agatctgtta tgatccatgt 69720 

aaactttcat tactcaactt acttcttttt tttagtgaaa aaaaacatta tattaatagc 697 80 

atattttcaa ggaaaatatt tatcatgaat ccacttactt tctctttcat ctcatttcta 69840 

ctttcatttt cttaatctca gtttttttta ttggattacc tattatatct atcattttct 69900 

ccctttattt tttttttctt cttattctct tttctttcct ccctaacaca ccccatttta 69960 

ggatgtaagt ttatgattac tttattttca gatacatctt acaataaaga aaaaaaaagg 7 0020 

gataaaaaaa accatatatt actagaaaaa acataaataa agatggaact tagagaggga 70080 

tcctcatccg tctttagtag agaccaaaac agtgacaaaa tgcaatatac agaaatgaaa 70140 

caaaaatatt ctaccttaga gacaacataa ggatatatcc gactctataa tattccatcg 70200 

tctctatatg tttaataata tttggacaca gattttagtg acaacattag ttagtttggg 70260 

ataattgtca acatagccca tctctaaact cgtttctatg tatctatttt aatattttaa 70320 

aaattataat tcattagaga ctgaaatgat tccgtctcta agatttttta tattaccatt 70380 

tttatctatt caactagagt taaagtatat gctgtattat agtttgatta aaaataagta 70440 

ctcaatatta attataatat tatataatta attgggaact gtacacatgc cttaatataa 70500 

tattatttta ttatattata tatgagtatt tatatttttt taatgtgatg gaaaaaagaa 70560 

gataagaaga aaaatcaggt gagatgatac aagaatatat atatataata attccaaatg 70620 
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aaataccaca aacattatat tagatacatg 

tatatatttt caatttaagc ttattgttgt 
agagatgcaa gttgctacat atggtcgata 
tgttggcaac ccaacatatc attggtgatt 
tatccgctct accactttct aaacaaaaat 
tgcattaagc ggtagattct gcatacagta 
agcaacgttg ctctcgtaat atcacccgat 
aaaaaaaatt ggtgatgtaa taatataaaa 
taatttactc tttctcgcat agaatgggaa 
taaatatgag tggacatggg aagatgtgat 
ataattttta taacttttta atttataatt 
cttttttttc aattcagggg acttgtgctg 
tttggtgtca aaggaaaaga atcagaccag 
acgaagctac tgtatgcgtt ggggaatagc 
gtcattccag tcgcagtaac gaaaagaaga 
tatataatca tatcgtataa ctttcataac 
tatttaatat tttcttaaaa tttatcatta 
atacaaaatc taatattaat ctaaaattat 
atctatatga atattttaaa atataaatta 
aataaaattt gattcaaatg atagtaataa 
taaaaaaaat atattatata tatatatata 
ccgactttgt ggctatatat gtgtgtgttt 
aaccatatag cagtataagt gttttgtaafc 
ctttttaata aatttgatcc attagctgtt 
cgtatcaaat tttcttatat aaattatgta 
aattacaaat actttatggc aaatgataaa 
aaagtaaaaa tatttttatt agaaggaaaa 
attcttttag aatttacaaa ataaatttta 
tccatactat atatatatat atatatatat 
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ttgacatgaa acaagtcact ccaaaaataa 70680 

aaataataaa aacttgagca gaagaattat 70740 

accatctcaa agaaatcctt ttggtcagtc 70800 

tggtgtcctg tctagtaatt tgagattgtc 70860 

aaaaaataaa aactacctca acaaaatcaa 70920 

gtatagcact accaagaaaa aacttcatat 70980 

attatttata ataagaagag aaatagaaag 71040 

agaaacaaaa aatgaataat gaaaaaaata 71100 

gcaaggacct cccagaaaat tttgaaacgt 71160 

aaagatcgat ccgaatagcg tattaggagg 71220 

gttacgaggg ctaggtagtg actttttctt 71280 

ttgtgcagtg tgctgcaaaa actggagttt 71340 

aatgaatttg tgcctagcta ccctgtattg 71400 

gcaaataaaa tgattttttt gcaacttcga 71460 

atatatagga aggttcatga tatatatata 71520 

tattatatta ttttttaact tttaattaaa 71580 

aattcaaaat tcattttata taaattatat 71640 

ttattatttt gtttatatag attaaaattg 71700 

ttgattattt ttttattgtt gttcaatttt 71760 

tatatagata taatttaacg attagattaa 71820 

attataaaaa ttatatttgt ataatttaat 71880 

ttgttcatgt tgtatttctt tctttctgca 71940 

gtacatcggg cacaacttgt gcttcttatt 72000 

ttttaaaaaa aaaattgttg aagataagta 72060 

tcaaacgaca taaaaaatat gaagcagtaa 72120 

cgatatctta aaagtattat taaaaaactt 72180 

aattatatta tttacaatat tttttttatc 72240 

atttcttaaa cagatcactg gtatgcaaaa 72300 

atatatatat atatatatat atatatatat 72360 
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atatatatat atatatatat atatgataag aacctcctcc cttactactc cctatgcatt 72420 

agttgttgac taacgccact taccaccaat ctcaagtgct tctgactaag tccaattaat 72480 

aaaggcacaa agcgccatcg tgccccgcac aatttatgtt aaaaatgaat acaaatcgaa 72540 

aattagggta aactgatgta atttttagtt tgtggctgct caaagactgc gagttatata 72600 

taaggatgaa gaatgtgcaa tgcaacttcc caagtcagaa aaattccatg caatcagtct 72 660 

tcttctttga ttttattttt cttttgcttc tttataacgt ttttgacttc cgtgaagtgg 72720 

caaaggcaac atattttgtg tattttattt tttctagtca gaattgagat actttttctt 72780 

caacaaaaaa tgccattcta gtattattcc aatatttata ctgtagtaat tcttaccatt 72840 

tcgtccactt aatttgaccg gctacccatt cctgttctaa tttggctcaa ccaccgttcc 72900 

gttttaagaa ttttgatcca aacgcgcacg tcattccaat aattttataa tcttaattta 72960 

attgttttcc tccttctcga ctgcgaccaa agtattcaat tcttaaagga ttgataagac 73020 

aaaaagagta ttacccaaac aataatgcac ataaatagtg cttagcacct aagtcaacac 73080 

tgtacaccta atcacttgtt tttaaatcaa aaaatatttt ttttatactt aaaaaaatac 73140 

cgtacattgc acgggtttgg atggtagttt gtatcaaaca aattaagcaa agataagtct 73200 

aaccgcttca cggggcagaa cgagaaattt ttgtattaaa gtcatgtgtt tatatgtaaa 73260 

aaagcaaaaa aagtattagt gcattagttt agttgcattt agttagcatc aatatgaaac 73320 

ctctctctaa gctcttaaac aagctttgtg ggaaaatgat aatgctatta gcatcccttc 73380 

taatagagct tatcattctc attcagaagc taagggaatc acgtcccatc agcacccgcc 73440 

aatacatcaa actcatcgag aagaagaacc ccacaatttg ctacaccaaa agatttaact 73500 

tgaaggcaga gcacgctaca gagtgcaggg tatgcttgtc cgaattcgag caaggggaga 73560 

agctgaggaa gctcaaatgc caacacacgt ttcataggga ttgtttggac aagtggttgc 73620 

aacagtactg ggctacgtgc ccactttgtc ggaaacaggt gttgcctgat gatgttgtgt 73 680 

tcaagcaccg tcagcatcag aatcaaccag aggctgcttc taatgggaat catgacaatc 73740 

ttctctatct tttttcagca tttcgtggtg gcaacactta atttgcacag ataactctct 73 800 

ctctctctct ctctgaatta cccaatgtct tcaacatatt tatagtagta ctatatagtt 73860 

aaaacaaaac cacctaatta ttttttcttt ttagaaaagg gtttaatttg ttattcttcc 73920 

cctgccagga ttctacggag gaatattaat attgaagtac accctataga tatagatatt 73980 

attaaagcat cgatgctacc ttttttgtat ttcactcttc ttctgcttat tatttggatt 74040 

aaggtttgac tgatctagaa attcatattc atatgtctgt gtaagtggac attttaattt 74100 
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tggggttaga aacatccggc aattgtgatg 

gttggttggt caaatgaggt agtggagtct 
tattctacga agaccctttc ttagtatact 
agattttttt acatccgttg acgcctttag 
atatattaag tgatatgatt taataggaaa 
gtcaactagg tatttaaaat ttataggtat 
tttgatgtcc agaagcaatt cttttggatt 
attaaaaaca aatattaatt aacgatatat 
aaaagtgaaa aacatagaat tatacagtta 
ataggtttag gttgtaattt gttgaaaaag 
acataaattg tgcttagcac caaaagtcag 
tagcaaaatg atacttaaat ctatttaatt 
ttaacatcct aagttaaata tatgcttaaa 
gataagttgt ctttcttcgt tccaccagct 
atagctagga aagacataag tcctcatcgg 
atagacgttg ccatcgcttt cgatcatttg 
aatatgtagc ctaggacatt aattattcaa 
tatcatagga cacaggggat gcatcaaact 
tggtttttcg atttctgtag cagtaaagtg 
tttcataaat aaatttgatt atttattctt 
ataagaatgc atcaaacagt gttgaaagtc 
tgcaattagc taggttctta actagtctct 
tgggcaataa accaatgggt tttgatgtaa 
tactcttcac atgcatgtgg tcgatgttgt 
tttctataag ttttcctttg cgtattggca 
gaactttgca agtacatatg atgcgtgttc 
aaattgttca tccaaaatga attatttgct 
gttcaacgtt gacttatttc ctctacatgt 
tagtacttac acaatttttt tataatatta 
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gaggaaaccc aacttcctac tgacggtaga 74160 

ctcagacaca tattattcat attctactca 74220 

gataaattgg taggtgaata agttgaaaaa 74280 

agagaaacag agacaagaag aaaaatatag 74340 

aaaagagata tatagaaaaa aaataagagg 74400 

aaaaaaatca ggachcgaat aagtttatgg 74460 

ttcataaaag ttgctttaat gttaatatgg 74520 

taaaatgaaa gaaagttagt gaattttctc 74580 

cacatgcaac ataaaaaaaa ttaaaacaat 74640 

attatgtttg gatcttctat gactatgagc 74700 

taaaatttgc ttaaaaaagt atttcaaaaa 74760 

gcataaatcc aataaaaagg gttaaagaaa 74820 

aggtaagcgt aaatggtaat gataattatt 74880 

taaattaact taggtcatgt ttgtcttcac 74940 

tgatctcaac ttaacagact tgacttagaa 75000 

ctaatcatgt ggcagaacca aattaaggag 75060 

tatggttgag aggtagatgg acaaattcaa 75120 

gacacttttt gggatcaatg aatcctcaat 75180 

cgataagaat taatttgtca atcttcatat 75240 

atattatgat ttgtttgtca atgaaacaat 75300 

gtaggtttta atggtgcact cctatcatac 75360 

gacattttbc aacacatttg cttggacagt 75420 

aaaaccaatg acaacaactt gaagtaggtt 75480 

attattgatt atatcaaatg ctattacaat 75540 

gaagaataat caagaagggt tctcagttat 75600 

cctaatcctt gattacagac atttcccctc 75660 

ccttcttttt ttgagttgaa agagacataa 75720 

attcacatca catacgtgat aaatgaattg 75780 

tttttatttt tttctattac atacattaac 75840 
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tatattatca cacgtcccaa tctcgctctt 
ctgttatgaa aaattgtata aacataattt 
actgttatat atttagatga ataattataa 
ttataattaa ggagctaaaa atcactctta 
tcaaaagcta gttttgatac gatgttaaag 
aaaatcaatt aataaaggga tgacctatcc 
ccacccgata cggctctcaa cattcatctt 
agtttaaaca aattttgtgg acaatgaagg 
tcgtagtaga gtcagtacta gtttttattc 
tcattccatt gcatgtatta aaatactact 
ttattatcat agtgggagtg gattagtggt 
ggttaagaag tgtcggtcaa gtagtgaaca 
ccacccacta actcttgcgt gctccttgtt 
gttagtgttt aggatattgg ttaagaaatc 
aaattataag agaatataaa aaaattatga 
aaattttaat taaaggaaaa ttggttatat 
tgataaatga ttttgatcca gtatatgtcc 
tagctcttca agtactgatt ttcaggtact 
cagtttgtta accatgcgtc aacacagtac 
tattcgtcat tggttggaca agaactgtga 
aaccatgaac catggatgct gattttttta 
gagttcatat ttattcctaa accatcagaa 
ctaggcctgg tctgttcaat ttgagatact 
agaaaaataa ataaattatg agtctctgga 
catccacact ccaatctcaa taataacaaa 
gtctcatttg ttggtgaagt cgctcctcca 
aagcgtagcc aagtgtccag gttatgtgtg 
aacaatgcaa aaattgtccc cttaaaggca 
tttcaaaaag tacttttttc aaaatacatt 
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ttctttcctg gtctctttat tatacatatt 75900 

gtcgtaataa atttcaaatt acaaattgta 75960 

tttctcatct ttttttttaa ataatttttc 76020 

gaatgaataa tgtgagatgc ctacccaacc 76080 

tgcagcaaag acatctaaaa gattttttta 76140 

aattatataa acacttatca tattcattaa 76200 

agaaataaaa gcttcttgta tatcaccaaa 7>6260 

atcattggat aatcactgta agcaacttgg 76320 

atctactcga ttatactagc aatttctatt 76380 

agattatatg aaaatgaaag tcactgggtt 76440 

tgaaatggta gctcaaccga tcgatgtgac 76500 

caataattga tttacaaaag catatgtact 76560 

tcaaacaatt aaattaatat aagggtattt 76620 

aatatatata tatatatata tatattgtat 76680 

atagtataat tttttgtttt tcaattttta 76740 

ttaaaattat tctttatgta gaagaaaaat 76800 

caggggcaag gggggaaata acagaggcta 76860 

ttgttggagt gtgattctaa aatcgtggtt 76920 

tcttcatctt tattagtcta tcgttacctc 76980 

ggttcatatt gtgcaagttt ttagaaaaac 77040 

gcatcagaag ctcaatttta ggagatatag 77100 

cttgattgac gcaatattaa tgagacttcc 77160 

ttctttctca atttttttag aagaaatagt 77220 

agaagccgta ggaatataaa cttgtagatg 77280 

tattccaagt gaagtaatta ttgaaattat 77340 

attgagatat gaaattttca tgactgtgag 77400 

actacgatgt cactgacatg aactatctca 77460 

ataccccctt tccccccctt aaaatttggc 7752 0 

aatttttaac atggctagaa agtatttttg 77580 
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aaatattaaa attttaacac ttccagaaaa 

tctgttttga aaagtagttt ccaaaaaaga 
tatagtatta gtaaatctgg gaggtgtaaa 
taacttggtc cacgatggat tggattttaa 
atggcaagaa tttttaggtg agcacactta 
tccaacttca aatttcaaac aataacaaat 
tattgagtgt tgatcattga tttaagtact 
atatactatc gaagaagtga ccagccataa 
aggcccaata tcataaaaga acgggtcgca 
aggtggccca gcagtacaca aagagacaac 
gttcacaccc catattttct tttatttttt 
tattatgtat tgtacaaaaa aatataatat 
ttcattaaaa aaaattaaca tgactgaaac 
tgacactatt attgtgagtg aaaaaaggtg 
gaatgtggcg tgagaagaaa gtgatgtagg 
gttactgtaa atgtaattgt gagaaagtga 
gatgggtgtg gtgagtgaga aagagtgtgt 
tggtgtttga agagagcgac ctgggttgtt 
tttaaaataa tagcaataca acggaaacat 
aaaaaaaata ataaaacaaa aactagttct 
tgtattaata agggttatca tacattataa 
tattttgatt ataatgaatt cacaacacaa 
ctgtggtaag aggtggaggc aatgggaccc 
ttttaaacct tcctaatatt atgtataatt 
tcctttggtt ttggtcaaga tgtgaaatgg 
accaggtttt acaattataa gtctttaggc 
tctacactac ttgagttgtc aagttgtaat 
ttaaaaaaaa ttaatggctc tgcaaattgt 
cctttttctt tctaatgtgt gtaaaggttt 
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tattttctga aatgtgttga aattttttaa 77640 

gatacttgta gaagaagaag aatatatata 77700 

taacagttgt taccacctaa tcttaggctg 77760 

tctcaaaggc ctagtacagt cgacagatat 77820 

agactatgtt gaatatatgg aataaaattt 77880 

aatccatgaa acaaacgact cctatactct 77940 

tacaattaat aatacatatg aaaagaatat 78000 

aagtgttgga tttccataag cctctctttt 78060 

caggtccaaa taaaacaata gtgtcccaag 7 8120 

aaggctttgt tattctattc tcccatccaa 7 8180 

taacaaaaat atccttgaca gacacatttt 78240 

aaaaaaaaca agtatttatt gtgttaaatt 78300 

ttattataac ttttttacaa aaaagtaaaa 78360 

tgttgttgta attgtgagaa agtaagtgca 78420 

aaaatggact tggtgagtga gaaagtatgt 78480 

gtgtggtgtg agaagatatt gatgtgagaa 7 8540 

ggtgctgtaa taattgtgag aatgtgagtg 7 8600 

gagaacataa acttcttttt gttgttttag 7 8660 

ttatgttttt gttgggttaa atatttttta 78720 

attttgttca atcaaaatat atttttgttg 78780 

tagaaatata ttttcattat aatagaaata 78840 

cctagttttt aaaataataa taaaaaaaag 78900 

ataggggcca tggccccctc ccctacagat 78960 

aattaatact acttttgatg atattatttt 79020 

tggtgcatga gtgtaggatg tgcaacacca 79080 

atttgatgtt ttcagactct gtttaattgg 79140 

cattgccatc ttaataaaat tcactttgct 79200 

aattatttac acctcacata tgttctcatt 792 60 

ttgcattgca tttcattaac ttgtagaaaa 79320 
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actagttttg aacacataaa acaactagtt 
agtttttgtt tttactaaca atacatttat 
caatttttat aatacctata ttcatgtaat 
tttgttgtgg ttaatttttt cttttttttg 
atatagtaat tttataagag aaaagaaaga 
gcatttactt caaatttaga tactatgttt 
ctttgataat aatgagatga gtgacttatt 
ttttgaagat aagagatgct atcttttgtc 
tttattctct tctatttttt ttgttattca 
attggttaat cttctttact caatataatt 
ttaaatttct tgcatatatt atgcatgtga 
gattttttat tagcccctta ataaaatcct 
taaacttaga gaaaatgtgt tgagaataac 
tacgcccact ctgctaaaaa agggcatact 
ttttcatgtc ctgctaattt cttgtttaat 
ttacactaaa atttatttat atcaatacat 
atgtacgtgt gagtaatgag tttctttata 
tagtttaaaa attacaaaac ataagatcag 
accggagatg ttaaacgggg aacacatttg 
aacacagaga aacatagatt acaaaaggaa 
agccaactat tgcaaccagc aagctgaagt 
agcaattgca aattgggttt tgggttccaa 
ctcaatcttc tccttcagaa ccaacaaaga 
ttgtacagcg ccagccttct taaaattcat 
catagcaata tcagacatgg ttgatggtag 
ttgtttgttg tggttaaggg ttgctcttac 
tagcgacgga ttctacgtct ctacgcgtgg 
cgtgttaggt ttcgattatt aaattattac 
tcatatccat taatgtgtcc ttcgttaaaa 
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tcttttaata ataatgctta ttagatttat 793 80 

atttagctaa catgtctaac tgttttctgc 79440 

gaaactcctt tttcactaaa ttaatttaaa 79500 

gttgagagat gtttaagaag aaaactatac 79560 

gatgatgaaa aaacacatat ggtgtctaat 79620 

ttattgacat tathggattg aaacattata 79680 

gttgattgta tttttgaagt ttcatctcta 79740 

acactttttt taggatattg ttgtcagttc 79800 

tcattgtgtg catgtgtatt gttggcactt 79860 

atatatttct tctgcttata agaaaatatg 79920 

aaatttaatt gttttttttt tctaaatcaa 79980 

ggctccaccc ctggctgtgg tgtaattgtg 80040 

acggttcagg ggacagtaat gcaaaacatg 80100 

aaaacaggtc tgccatatat ttacttctgt 80160 

caaaatatct atactaattt cctatagagt 80220 

atccctccaa ctccaagagt acatgtatgc 80280 

accttctaga atcccagaca atctaatcaa 80340 

atatgtcaaa gtatatatat aaaaaggggc 80400 

cctttaattc aaaacataac tatgcacagt 80460 

aaaataataa aacaacaagg tgccaacaac 80520 

tgttcaagca gaaaagaaac tgttagaagc 80580 

aacccacttg acaagctcct tcttctgcac 80640 

agcacaagaa accacgttca agttcctctc 80700 

tgtcctcctc tcatattctt ctgctaccaa 80760 

attctctcac tcactatata tgaaacgatg 80820 

taaaaccttg ttcaaagcaa aagctattta 80880 

ccggtcaaca gtttcggata aggctcaagc 80940 

tcattactat acttttttag ttggggtggg 81000 

aattctgaag actttccgac gaaatataga 81060 
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ggactaacta atgtttcttc aacctaccca 

caatttggac agatctaatg tgggagaatt 
tttcatatat aaatgttgta caaaaaattg 
ataaaaagaa aataataata ttataaaatt 
ctaaaaagga gtttaacttt taaagccaaa 
acaattataa ttcctttttc aaaaagtata 
tttactttaa agaagttaaa ttttaaaaag 
gttttgcatg atttgtaaaa tattccattc 
caaaatattg aaacgtggga cacgtgtaac 
aataagcaat aatgtccgac atgcacatgc 
agaagggatc attaactttg gaatcctaat 
taaaactacg gattggtccg atggttagtc 
tttagcacaa aattttgtat ttatgtgtga 
tgataagtgg aattagtttt gggtttagta 
aacatatttt aaagtagaaa aaaatgaaaa 
ttttggtata gactacatgt ttaggattta 
atctaattaa attgaaataa tatcatacta 
gatgtaactt cttcttactt tgatgtttta 
cttcatatca tcaataaaat ttctccatgt 
gtgtctttct ctattgatga atttagatct 
acatgaccaa ctggtttgaa accaaccttt 
gagctcttta attctagagt ctcatggctt 
aaaacaacaa tttttctcat ccctaaaaaa 
ctcatagtcc tttgcaatgt tatatacaaa 
aagcttattc ttccaaaatg tatctctcct 
attttggata atgtccttat agtctctgaa 
ggaaaactat gagaaattgc tttcaaaatt 
tcgaactatc tttttggcat catgggtaag 
atgagaattt gtttcccatc aatccaattc 
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tgttgttcac atgccctgaa tatgatagga 81120 

ataaaaaggg gactgaaaat aagaaaatgt 81180 

tccaataagg aaaataagat gatcgatgca 81240 

attgtttcaa tcaatttgta aataatagaa 813 00 

gatggtaaat gtaattcact ttttaaatta 813 60 

tttttacttt ttttcatcat taattatatt 81420 

tgaaatgtgg tggtaataat ccaaacaaga 81480 

tctgtgatcg gtcatcaaat tggaaaatga 81540 

ggtttctcac tgtatacgta gtatagagac 81600 

ccaagcctgt ccctatcctt tttttttttc 81660 

tcctaaaaat gagatacaga tatacgtttt 81720 

atattgaatt caccttatgg agaagatata 81780 

aaatcttcca gataagtgaa agatacaata 81840 

gactgaagga tacccttgac aatatcaaaa 81900 

gaatttaaaa aataataaaa gtaatgtatt 81960 

atatgcatat tcagaaatta aacttgagat 82020 

gttgatctaa gcgctcataa taataaacgt 82080 

ttcaaagcta aaaataattt taataattgt 82140 

gtttcagcta atgataatga ctccctctta 82200 

acactctttg aaatggattc agataaatac 822 60 

ttaaagaaat tttggcatct ttgtggatct 82320 

gaaactagcc aattcccacc tcaggtaaat 823 80 

tctaatccca ccaccatgaa agatttttga 82440 

attatgtcca aagtcctagc caatggcctt 82500 

aaacattctg cttttatcca agatcgttcc 82560 

attattcacc acataagatg caaacaaaaa 82 62 0 

gatatcagca aaaaaattta ttgagttgac 82 680 

atgggattta acaaaaaatg gataaactga 82740 

taggtggttg ttaatgggga taatgtgggg 82800 
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gaaatttttc gagggagaag acgaaagata 

tatttgcact ggggtctttc aactattctt 
tggtgtgaag gtgtgcaaag ggactcccct 
gttttctttt ttgtcggcct aaaaataatg 
cttatggaaa agtatcaaga caacttatca 
cgattattca acaacaacaa agacacaaaa 
atccaacaaa tatttaggac ttccttcaat 
ttcatcaaag atcgtctttg gaattggctc 
gcgggtaaag aaatcctatc aaatatgctg 
cttttttgct ccattccact ttaaaagatg 
tggggcaata acaacaaagg aaatgaatca 
aaaagaattt ggaggaatga aattaagaca 
aaagcaaggt tgaaaattca ctaccaacca 
taaatatttt tcatcaatgg atttcttgga 
atgatttagt attcacgctt cacaggtaat 
aatggtgatt ctactcatgt ctggactcaa 
tgtctcaacc accccaataa tggtatctct 
gctaattgcc tttgaaggaa tgatattcta 
aaaattcgtt ccctaacact attgaatatc 
tcaacgaatg gtgagtacac tgttaagagc 
gacaatagcc acctcaaggt taaagggaat 
caaagaataa agcacttcac ttgaaggctt 
ctgcgcaaga aaaatatcct ctatccctta 
aatgagtggc atctcttcct tgcttgtgat 
ttatggaatc tcgctgagca tagatgggac 
gattttctat aggaaagcca tcttttatgg 
aatggaagca tgcccgtgcc ttgcacaacc 
cagcatcata aattcctcct caccaaagac 
tcaaggattc tcaaaccttt ggcataggaa 
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gaatgactca ctaccatcct acctattcat 82860 

aaaacacggg gaggtgagag gtgacataca 82920 

cccccctctc acttattatt gctaatgatt 82980 

aaattttcac cctttaggag atcctagaca 83 040 

attttcaaaa gtctgaaatc tttttgagtt 83100 

tctcttccta ggtgtcacta aaagcattgg 83160 

cattggcaag agggaaaagc tatttttggg 83220 

aatcattgga ctagtaaaca actctcaaaa 832 80 

ctcaatctat acccttttgc tgcatgagtg 83340 

attttgaaga aaatgatgaa gtgtttttgg 83400 

attggctcag ttggataaaa ttattataaa 83460 

cttgtacgcc tttaatctta ccatgttagg 83520 

agatactata atgttcaaaa tcttcaacgc 83580 

tgtccacctt gggcacaatc ccatttttgt 83 640 

ggtaaagcaa ggtgttattg gaggattggc 83700 

ccttggctct aacaataatc ttaatccttt 83760 

cctcttaaag tttctgacct tattgatcat 83 82 0 

gattcaaact tcaaccaaga ggatgctcaa 83880 

acggagagag atcaactaag ccggaaatat 83940 

gcatatcatg ccattatgga tcatttgcta 84000 

tggaagacca tttggaatct tgttgtgcct 84060 

cttcgaaaat gcattccaac aagatccaac 84120 

cattgtgttt gctgtgattc aaatattgaa 84180 

catgccagac tcatttggca agctgctgac 84240 

aatgcagata atttccatga ttttatgttt 843 00 

ccctctctct tgctatgcag catttggcag 843 60 

aataacgtcc acctacaact cataatggca 84420 

atatcaaatg taatttggat gctgccctac 84480 

tgtgcttgcg ggattctcag ggaaatttta 84540 
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tcaaagcaaa atctgctaca attcaagaac 

ctctacacca gtccatcaat tggataaaaa 
ttggattaca agacggtggt tgataatttg 
tttacaaaga tgtagagcct ccatctcttc 
tataaagaga cgaggaaatc aagcaactca 
tagctttggt gtcttcaatt acatcccaac 
gtaataacct atgaagtacc gataccaaca 
gtaatgttaa aaatatagaa cgtagtatgg 
acactagaca cgacaagagg ttagaatatt 
atttaaaaaa aacatgatat ataaaatatg 
ttcagttaca tcccaacttg tattatttct 
ttttcatttt aaaaaaacat gatatataaa 
tattttattt ttgtcaaact actttcttag 
gaaagtaaaa taatagtgag aaataataaa 
acatccggtc aaaacaaaaa gatgaaacaa 
gctagaataa aaaaaaagtt acatgtatga 
cacaaatttt caatttgatt cttacagtaa 
tatcaattta attcttatca tcatatgtca 
ttaatttagt ttttataata tagagcattt 
ataaataatt gtatcgctat aatttattct 
ttagaataca ctattcccat ataaaatcat 
tatgtctctt aattaaataa ttcaagattt 
aaagataaaa atatacaaat tttgagtagt 
tttaaacgca aaaaatgtac accttatcgt 
aattacgtgt cattgtaagt cggtgatgaa 
acgtggctat tttcgtaaaa ctccaatcca 
gactcgatca caaatccgtt atttttttag 
taagtaaata ataagtaaaa tggacatgag 
atactaaaac attatcattt taagaaagaa 
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atccaaaacc tgaagttgca gaggcatggg 84600 

gatctacaac tccagaacgt gatttttgag 84660 

atcagttgtt ctacaggaaa atcagactta 84720 

tatcccaaac tcaaactcaa gggtgagtgt 84780 

taacttagca agagtttctc gattttatgt 84840 

ttgtattgtt tctcatgtta tgaatgaaat 84900 

cggacaccgg acacggacac gtgaacacct 84960 

gtgttggaca ctggcacgga cgcgtgtcga 85020 

catgcttcat agataataac tatgtttttc 85080 

ataaaaaata aatataggtt tgatagtgtc 85140 

catattatga atgaaatgca ataactatat 85200 

tgtgataaaa aaataaatat aagtttgata 85260 

tgtacaggat aaaacaacag aaaaatacaa 85320 

ataatggaag atatataata gatcatacaa 85380 

aagagagaaa aaaaatgtaa taaaataaaa 85440 

atatcaacat cctttggcct taatacgtgt 85500 

aaaaattact aaattttatt tgccaaaaat 85560 

atcaataacc attgatctcc ttacctattc 85620 

ctaattgtat cgctatatat atatatatat 85680 

taatttggta cccgtaaata gaaaccaaat 85740 

aaaaaacaat tcacacaagt gataaaaatt 85800 

aaatcctaat tcacctttta agtatgaaat 85860 

ctactagtcg tacaaacact aaattatata 85920 

tatattatag ccttgattta taacaataat 85980 

tgatgatata ccatatatgc aacattatac 86040 

aataaaaatc cgtgttctga atcagtttct 86100 

gaaccacaac tccgatttat cccataacaa 86160 

ttttagcaaa aaaaaataga aaaactaagc 86220 

agaaaaaaga ctatcataac tgattaggat 86280 
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cataaaattt cctgaaactc caaatttctt ctttgtgcaa aattatcaat ataatcctct 86340 

ttgttaaaat caacagcgtt atgggatgct taattatcaa ctcaagactc agttaatgac 86400 

gttaaagcca agaagattca cttccattat gcgggacaac aaatttaatt taacagaagc 86460 

aacatcatca taactaaaaa aaaaaaaaac taaactgatg agaaagagaa tgaaaacaaa 86520 

actacgacct agaacaaaca caggaccaaa catatatttt tcctcactct aaacaatatg 86580 

cttttaaaac ggggcaactg tacgaatgaa attctaccgc tagaattctt tgattagtaa 86640 

agtaatccaa atcaaaacca aaatttaaaa caaatgggac aagcaatgaa taaaacctga 86700 

ttacccaagc agaatcttat caaggaaaag aaatgatgta cctgtaacct gttactggtc 86760 

attaaaagaa tcctcttcca ttttctttac ataagtagac cgtgtcagac attcacaaat 86820 

ttccacatat cagcggttca gcacaagcgt acaactctag ccacaatgaa ttgaatggct 86880 

attacacaat tatcaattaa ctgcgggtct tcatgaaaag gaccagctta ccaaaacatg 86940 

gggagggcca atgtatatct acttccattc aagtcatcaa gtagatgact gatgactctt 87000 

tcaaactcgg actcatagta ggcctaatgt tttcaactgt tgaatatgct cgggtggaag 87060 

agccggggga gaccaatccg atcctgtcaa ctctccatca acatcttcaa ccacatactc 8712 0 

ctatgatgag aagcaataat agtttcacaa tcaacaattc aagacgacgt cagaattttt 87180 

cagggcatca tgatgttctc ataaaacatg cagttaacac atgggattat cagtactgat 87240 

aactaatctc aaactgcttg tgttccatgt caaatgtaat ccagaaagag gccgcccaaa 873 00 

agccagcctt aaggcatcaa aataccagac tcaagtctaa gctcaattac tatcacaccc 873 60 

ggataagcga acgagagttg caaaatcaaa tctaaaacac ttgtcatatc agacagacaa 87420 

aaagactaag gagagagaat acatccaaca aaactcttca aataagttga tgcagacatt 87480 

gttagcaatg aataaaaaac tcagatggta aacttcacta acaaattgaa tgcaactcca 87540 

aacggtgaag acccatcaaa tccaaatctc gaaactgtgg gtcaaatcac atcctctcca 87600 

gccaatcttg tctgcagcca aatcataacc gttgattggt tattgcacat taattcatgc 87660 

agagagaaaa tattgataaa tggtccagat tttggctgca cagaacatcc tgctgcagag 87720 

gcaagagagg atcctgatcc acagtgacat atctgacaac cgtatggcag cctttctgtt 877 80 

acatcagtgc ttcctagcac cacagcccca ccctagctaa attatacagg caatggaaga 87840 

gcaggagggg attgcataat gcatactcat gtacttggga tgggtgtgtg agtgaggata 87900 

aacttgagga attgatttgt aacacagctg cacatgcagc aggtgggacg aaggagtttt 87960 

gctttgagaa cagaatcagg ggaaggggga gggaggagga agtgaggcaa gcatgtactg 88020 
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agcaagagac agagaagaag aaaagtagac ccatcaaaga gtctgacata tttctgtatt 88080 

tacttctttt ggctgagctt ttatacaatt aaagatcctc atctatataa tggtatcata 88140 

ttcatatcag tagcaaagat gcatattatt acgagagttt ggaagtactt tccacattat 88200 

aattccagtt ctctgccaaa tgtcaataat ttattatact agaaaatcct tttgaagtca 88260 

atcaaaacag ctacatcaaa agcaatgcat gttatataat ggtaagctag tagctgaaat 88320 

acatttactt tcgcaaaacc atcaggcatc ataaaaatcc tcagtcaata tggctctaat 883 80 

gcaagagaat ctaaaaaaca agtcaattac aaaagcagaa aaaattatgg ttgagattct 88440 

tggctattcc ttccctggta aatgtttctg tgaagatatt atataggact taaatgctaa 88500 

aaaaaaccca agtgatggct aaatgggcta attgcatata tcagttagga atttagcatg 88560 

acttgacagc tgaattgcaa caagaattga cagctcatag caatgcacca aaagcacaaa 88620 

agatgagaaa gtaaaccaac cacacctttg ttaacatcct ttttgcaagt atgtgataat 88680 

gacgttgcca tatccgttga cctaccatgg tagcaaccaa tacactgtag aatatcccaa 88740 

ttacagtaaa aagtcccaga acaatcaaag ccataatgaa taataatggt agccctgctt 88800 

ccccagcacc ccccaaacaa ccaccacatt cagttgccat ggttccacaa ctctcaaagc 88860 

atgtggtgca atcagtccac atacaaagag tgcctggcaa atgacagtct gcacaaactc 8892 0 

tgcaacaaca tacaaaacaa aaatcaaact tagtcacgaa tcactgccat tctgagtcac 88980 

aaacttgctg ctaatataac ctaattcatt aataacacga cacagccttc tttctctgtt 89040 

tagcataagc aagcatgaat aatttaacaa cggaatatta tgtctttcaa aaaaaagtta 89100 

ttaaaagaag ggtctgtgat atagtgtaaa ttaaatcttt atctttatat attacactcc 89160 

aagcatagtt tctggcatcc tacaattaag caagaaaaca agcattataa cgatgcctca 8922 0 

tgcagatgct cggagtttca tcatgtgttc tgacaatatc agaggcttgt gaatttgtaa 892 80 

aatcaaggtt attacccagg ttgacagcaa caaagacata gttctctaca aggctgagct 89340 

aaatcattgc gtactcttcg atcatagcag gtaatgaagc acccagaaag gccaagcaag 89400 

gcgaaaaaca atagagctcc tgcattaaaa aataccaaca aaacatcaac acaaaaacca 89460 

gtatcttttt aaatgtggaa caaggaaact atgaactcaa ggtgatggag gaaaaaaaga 89520 

atttgcagaa aataatgttg tataagcaat ttgatgacac cgtttaagct aaacgcaatt 89580 

taccacatat atagtaaaag ctcatttcac tatcaaaacc ccagagaaga cgaagccaat 89640 

attgctggta accatctatt agataaacca aatatgccag tgaagcaatg acctgcatat 89700 

aaaggtaatt aaggggataa attggatcac ctatatgcat tcataaagta aaaatggata 89760 
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agtagatgga aagattaatg gaaacatatc acctacaagc tggacagaca gaaaaataaa 89820 

caaaatatct ctggtgacaa aaaaccgaaa tttcagggta cgccatttcc tatcagcggc 89880 

aacatgaaca cgcaaatgat aaggagcctt gcacgtagtg cagtgagcaa aggcaaaccc 89940 

ttcctgcaat aaaaaattaa gtattatcca tatatttctt aggtcaagca ccgcaaagta 90000 

catcatttta cacattgtta tttaatatgg aatatccagg tgtttctcca tcaatggtta 90060 

tttactcagt accttacacc ccacaatgca ccatatacat tagaggattt tttttccaaa 90120 

gaaaaattag cttgttgtct acagttcttt tcatcactag actaagtcac aacaattcat 90180 

tttaaagata attaattgtt acagtatcat actatcatta aaaaaaaagg aagaatagaa 90240 

tggaagacta aagatagtta ctaaacaaaa atgtgtcaat ggcaagagat acaagagcaa 90300 

ggaaaacatg ttaaaagcca tggagagtaa atcaaaactg caaacacaaa ttgcctaaaa 90360 

taaaattgac aaacatgtaa actcttttca tcaccgaata gggtaatggg tagccaagag 90420 

agagagataa ctgaagggta agaatcagca gaaatcatca gaaaggatat gaaatgttac 90480 

caagtcaagt acaggctacc aggggtggat caagagatgt atgaaagggg ggccaaattt 90540 

tttttataca attatttaaa tatttaactt aataaatgat tatttaataa attatgaatt 90600 

tcacaaaatt atttattatc aatttcatgc gtaaaaccag agatatgatt caatacaaag 90660 

gaaattaagt aaacactaac tcaacattct aaattttctt tacttttctc tattaatatt 90720 

tttttaaaaa gaaaattctt ctcacatata caaagagtct tgagggaggg agggagggca 90780 

atgccccccc ttgaatccgt taccacaggg taatagacac aggttagaaa aaatatcatg 90840 

agatcaacaa atgaaaacat aacatagatc atttgattaa ttgattaacc aaacaaaagg 90900 

tttaccgacc accaaaaaat agccttgggt cagttttgca taaaagatac agcttgccat 90960 

ccttactatc aatcacgaat catcatcact tataaagttg tgttgttatg taataaatca 91020 

ctaatcatta ccttaattgc tcgccaatga tccaaacatt ctcgatgcac atattttgat 91080 

gtacctttgc acttacaagg ggctatgaaa tctctacctg ggtaaaaaaa aacaaggaag 91140 

ccaactgaaa aaaaatatga tgacacatcc aagcgttgag gataaacatt caacaacgaa 91200 

gcttacatag ctgcaatacc caacacgtgg aacaataaca gtcccaatag caactataca 91260 

cataattgca aaagaaaacg caattcaaat tccaatcaaa tgatccagac gttatcgaat 91320 

tattcgaata caataattca tttactttgc aaccaaaaat cagctgagct tgaccaataa 913 80 

ccgtgcttct taacttaacg cgattcaagt tgttataaaa tgaattgaat aaaccgaaca 91440 

ggtaaaattc gaaatcacga cttattttat ttagtatata gaattaagaa tagattgttc 91500 
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tcaccacatc gaagtgcagc ttgccctaac 

aaaaagagga tgaaagtgaa attaccgtca 
tcggagggac cggcttcgag gtcgatctcg 
agaggagaag aatctgagtg atcggccatt 
attctaattc ttcatcatca tcgttagctt 
atctttcact gatgatcctt ctctttctct 
tcaagttttt gttatcacag acacaaaagt 
caaaactaac cgcgaataac aatcaaatct 
ttaataaaaa ttaatatttt aacaataatt 
caacttttta tcaatttcaa aagtaaatta 
aaaattagcg tatattattt agttatatat 
aattttttaa cattttaaca attgttgata 
ttaattgtag cttttcactc ttaattataa 
ttacttttct gtgcattact acattctgtc 
catcggtttt tgactattta tttgcattag 
attgaacgac actgatatgc tggcgtccaa 
gtaaagaaca aaaggtatga ttaagccttt 
ttaaattctt aaaaaagact ctttcttctc 
ttgcttgatt tatagataaa aaaatttata 
aaaaataaaa atttatataa atattaataa 
agacatagaa gttgttatgg aagttcttta 
tctccaatac aaaatactca attgtcctaa 
gaaattgaga aaatttacta aaaaaaactg 
ttaagtaaaa aaaaaaactt aaattaacat 
tgaaacaaat aaaaatacat caaaaatatt 
gtttttctaa tcacgatatt ggcatgaaat 
gttggaaagg caataaatac atataagata 
cccaaaatca taattatttt attatatagt 
aaaataaaag tagacatagg caataagtaa 
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ctgaaaagtg aaactgaaaa atttaaaaag 91560 

gtttcgagac aaattcggca ctggatttgt 91620 

gaagggtcag cgaggggaag gggaggaaca 91680 

gaaattcgga gggtacaagt gtcaaaatga 91740 

agattaaatt tagaatagag gagaagaaga 91800 

ctctagaact gccaattgcg aattgatttt 91860 

tgcgagacta cagtgaaacc ggtgaattcc 91920 

gcgttggata ataaaatcta atgaaaaagg 91980 

gtaatcattc tgatttaatc acattcaaaa 92040 

tataagaatt gaatttattt ttttaaaaaa 92100 

ctaaacattc taacaaatta accttccgtt 92160 

aacaaaattt aatttaaaag tattactagc 92220 

ttttatgaaa tttttcatta attttttttt 92280 

ttttattttt aaaatttcac gaattttatt 92340 

ttgactttat taactgctga ggtaaatttc 92400 

ttaaagataa aaatactcga tttttttaaa 92460 

agataatagt aaacgtatag gagatagggt 92520 

actaccatta aagtaactaa atcattcggt 92580 

tataatatta atttttatta ttctaattta 92640 

aacaaaaata ataaaaataa tgtcacaatt 92700 

ttgaagcttc aacttgaatt ctcattataa 92760 

taaaaatttt aaacaacttt tagattaaga 92820 

ataaaaataa ctaaaaataa gtagtatttt 92880 

ttggatcact aaaattatca ataactaagg 92940 

acattgatat gtgtatggtc aaaataattt 93000 

aattcttata aaaaataatg attaatcttt 93 060 

agaaaaaaat ttaatataat ttatttcata 93120 

aaatatacac ccatagacaa taagtaccca 93180 

atttaaaaag ccagtactcc acgagttttt 93240 
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ttttttgttt tttttaccgg aaaaccgaac caaaatatcc cccaaactaa actacaaata 933 00 

ttatcggtta gtaaaactgt aaacgtgcca agattgtatc tttttggtag tttttgtagc 933 50 

atttacttcc aaggaatgaa gaaaagatta ttcgtagaga ttttgggctt cggccatcaa 93420 

ttcaagtttt tgggctcatt accagtccat ttatcatacc tttccaagga gggaacaatt 93480 

attgttattg acactatgct aatagcaatt ttatcattct acattccttt ctttttccta 93 540 

catcctcaac ccttcttcct tcctctctca ctttcccctt cttatttcat aatacaccct 93600 

acccttctcc cttcctctct cacttcttat ttcttcatta ctttttccaa ctcttgtctt 93660 

cactctcgtc tccctcacat cacctaatgt tgtatttttt tttaatacaa cagaatcgtg 93720 

tttttgttgt gttttttaca caatgaaaac atgcatgatt tgcttttttt ttatttttta 93780 

gaattgtaaa ttttttaatt gttctaaaat atacatattg tatagcaagt catttaatga 93 840 

ggtccttaac aagtgaataa tgagacaaaa cataacaatg gaaattattt ttcgacaaaa 93900 

aaattataaa acagaaacat gtttctatcg tgcatgtata taatagaaac atgtttccat 93960 

tttatgaagt gcatcagata aaaatataac aacaaaaaca tattttcatt gtacattaat 94020 

agttatacaa tggacgcatg ttttcatagc gcaattgggg gtggcaaata aaaaaacaca 94080 

acataaacat gtttacattg tgtaactatc aatatacaat gaaagtatgt taccattatg 94140 

catatgggga taaaaaaatt taacaacgaa atcataattc cattgttaaa ttttttttaa 94200 

tcccatttgc aaaacaaaac catgtttcta ttttggttta tgtcaggggg caattttggg 94260 

ataccataaa aaggtactca cactggtagt gccaataaca aaagtaggca gtgataatag 94320 

caattccctt taaaagaaag catggatgtg atggaactct atttagtatt ggggctgcta 94380 

ttctcacccg caattttgca aaattacttg cctgcccctc ctccctttcc tgcaccccac 94440 

catctttcct tcttccttgg ttgcacttga atctgcgatt aggtaggttt tttttttctt 94500 

catactttta actgaactga gttataattt tgttgtattt ccaacacaac agaatcatag 94560 

aattttttat ttataaaaaa acaacaaaat tataattctg ttagcgtgtg tctgggttga 94620 

gatagtaact taattcgaga gattattatt cacgcttatc gtgatatgat gccatcacgt 94680 

catatatcat tcaaacttat atcacgtcgt cattattgtt ttcaattaat catgtgccaa 94740 

taatgctcaa tttaaaattt atttatttaa cgtaaaaaaa tggcaaaaaa taattataca 94800 

cagaaagaaa attaagttac tctaagataa tctattgata acattactac ttttaataat 94860 

tatttttcaa tattaatttt attattttaa ccattaaatt aaggtttaaa tattttttta 94920 

gttcttatta tttaatgttt tttttagtct taaaaaatta tatttatttt atttttagta 94980 
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cttataatat tttaaataat attttgaaca gtaaaaaaat attataaaat atcatatgaa 95040 

tcaaaaataa aataaacata attaataagg actaaaaata aaaaaaaaat ataataataa 95100 

aaagaaaaaa agtactaaat tacagcacta aagaaatatt aaatcttaaa ttaaagttgg 95160 

gttgaaaatt cattaaatta aagttaattt ttagcaaata aaatatacat ttttatttaa 9522 0 

aactaaattt atttaatact tcatttgacc aatttaatat aatataattt aaagtatgtt 95280 

tctcttcatt ataaaattag aaaagtttga tttttgaccc aacgaaactt gttgtatgtt 95340 

ttcaattctt ctaaaattaa aatcttataa tatgtttcaa agttaaggtt gattaaaatt 95400 

gaacatatat gtttaatgaa tataagttca attttacacg tatttagtcc ttaagtggag 95460 

attcatggta tatgtatgtg cattttaaat aattgacctt aactctagat caaattatga 95520 

tatattttaa ttttacaagg attaaaaaca taaaaaaata gtatggacaa aaatcaaact 95580 

cttttaattt tataaagatg aaaaatatat tttactttat aatatgtata tatatatata 95640 

tataatagtt tttttaagaa aagtataata aatattaact ataagttttt ataattatta 95700 

agctaatgtt aatatattca agaattcaaa taattaatat tttacatata ttggctaaaa 95760 

aaaattacat atataattga ttgcatagtt aaaatgagta attgatccaa aaccgatcaa 95820 

aacctttgat tataaagggg aaaaaaaacc ttgttgatgt tgatagggaa acttctcagt 95880 

tctcattgtc aatcagaact tgcaagttgc atttccacaa actgagttga taatttttgc 95940 

tttcaacttt aatccggcca gtttcgtgac cgtgtttgag gagcacagca acaccaaaca 96000 

tacttgcctt tgaggtcttt accctcaagc caatgatagt aactccaaag tcaaaaagtt 96060 

cagtattttt tttttttttc attttttcgt cagcttgaat gacacttgga tttagacaaa 96120 

ggttttttca taataattat aataataata attttcacgt aaggttgagc ataggacgag 96180 

ttaattcaaa caattttatg tcattatttg acctgaaatt ttaataaacc tatttatttg 96240 

agtcaaaacc atccaactac ttgccttgca ccccaataaa aatgagaaaa aaaatttact 96300 

aagaaaaagg tgaggaaaaa gtatcagaaa tcataattaa ataacaaaaa aaagtatatt 96360 

aacaaaaaaa gtataagaat tttatccttg aggtttgaat attttttcta ggtgttttca 96420 

tactttcaaa tcatcttgta agaatctttg catgacttca ggttaaaata atatagtatt 96480 

aataaaaaaa agtataagaa ttttatcctt tttgaaacaa ttaacttgat ccaacttaaa 96540 

ttaattgaag tataaaatac ttttaaatac ggttcatctt ttatttaaag taaaagataa 96600 

cgtataagac tcaacgtgtt gtcaactaac caatgttttt ttcttttaca aaataaaatg 96660 

ttcaagttat cgtatcatat caaatcagga aagataaaag gctcacaatg gattaacata 96720 
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aaaagaaggg gacaagagat aaggtggggt 

tacccattga ggcaaccata taatcagtaa 
gagtaattgc ttatagtgtc ttaatatttc 
atttttataa ttaaaaaaaa tgttttattg 
taaactgtat tattattttt gtattagata 
ctctttatta atattttttc atttaaaata 
ctaattaatt taccaataaa tcaaatgaaa 
ataaatataa ttattttttt atagatcaca 
tacagttaca tactcgagat ttgaattaga 
gtagttgatt tatgcactcg attgtacaaa 
gttttgggga tttcacttac ttaatatgat 
aaaaaaagaa atattattat tattacaatt 
acgaaaaaaa aaaatattga gtttcgttta 
ttaaaaacta gtgtctgcat catcacatta 
taaaatcaaa ggaagattca ctgacgcaag 
cacattttag tttccagcat gcaaaacttt 
gtgtcttgaa caaattatga aagtagttag 
ttcttaatat gcaatgttga tgtcccttac 
acttggggtt gctactgttt gggaagctag 
ttttcagaaa cttggaaaac ttaaatagta 
ttcaaagaaa acataaaata acatgttttg 
tgaccaattg aagacaataa attagtgctt 
tggtatcttg gaatatataa aattaaaaaa 
ttctgtttgg tatgtatttt ttaaaaataa 
aatttaataa attttccata aaaacttttt 
tattttaatt tttactacag taaatatatc 
aaataattaa aataatcgat aaaaatatac 
aaaaattttg aatggtcaga aaaaaaatta 
tcattatttc caaaattaaa aaaaaaaaaa 
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tgtattgaaa cccttgcgca tgtgagaaat 96780 

gtgaactgtt atttatggag aaaaaaagta 96840 

gaaacgtgtc atgtagagta attgttataa 96900 

gttataaaac caaacaaaaa atgattccat 96960 

aatgattatt ctccctaaaa acattaaatg 97020 

aataaataat ctcaacaagt tatagttatt 97080 

ttatatagat attttaatat ttatccaatt 97140 

attatttttt caccctttga gtattcgata 97200 

aattactaat taagttaaac taattttgat 97260 

tataattgtt tttaaatatt gtgcaaatgg 97320 

ttagaaacat tattattatt atttctgtaa 973 80 

tcttcatcct tcttatatac ttttttaaaa 97440 

tgattaagaa aaatgtttca taattaaaaa 97500 

ctaaactagg ttgcctgatg tcactactga 97560 

aggaagtcat catagaagca gctataaaaa 97620 

tgattacgcg agacatatct ccatttataa 97680 

caacttagca tcctcttctt ctttttttct 97740 

tttccttttt cttgtttttc aaaatacaca 97800 

agggagtttg gcaggaaaaa atattttaat 97860 

tttgatttga gtgtcttttt tgttttcatt 97920 

tttaaaattt agaaaatatt ttcaaaaaca 97980 

tcagtttttc ctgaaggggt gaaaatatga 98040 

taaacattca aatcaaatgt tatcttaatt 98100 

cattcttgag aaatgatatt tctaaaaaaa 98160 

cgtgaaaaga ggaggtaatt ttactttttt 98220 

atattttttt attaaattat ataatatgat 98280 

ataattattt aattttaaaa aaaattattt 98340 

ttfcattaaca tgaaatatga gcatgaagat 98400 

aaactcctac aataccaatc gtcggttaaa 98460 
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ttcttccata gcttgagagg ccccaatagc cattcttagt gacaacccac tttggattgt 98520 
catttggaat aaaaccagca ataggaaaag ccaattgggt gtgtttgctt gacgtatcac 98580 
ttgccattat taattattca aaccaacaca aaatagggaa tatgaaaaag aaaatcactc 98640 
ttgtgtcaac cgaccattcc aaatgcaccc accaccatca ttgtagcttt atacacatoa 98700 
ttgacaccga gacctaatag cctgtaatta tttaaactta tccaatcttt ttttaaaact 98760 
atttcttaaa tgtcttcttt cctcaagcac ataatccaat ccatatccaa ttttatctat 98820 
tgcttatcgt tttggccttt tttcttttac aattacaaca atttcgcctt ttttcttatt 98880 
agccttacgc agttggtgac gccattgtac cttgcaatcg cacgaggtga aggtgtccca 98940 
atgtccctca caagtacaat ggtcgcccct tatgcttcat tcgttgtctg tgccttttcc 99000 
gatacacgta tggttccttc ctagcaaata ttttatggat taaataaagg ataatgttct 99060 
tgtaactaag tcgaatctaa ggtttaacaa atttaatcca ataacattta attgagataa 99120 
tcataatttg tcttttcagt tatatttttg ggattcttat gaatatgatc tcctctttaa 99180 
ttttataatg attgtcttta tttttatctc tcaattaatc cttaatatga gtgttttata 99240 
atgacttttg aacgatttta tagtttcgga aaatattata tacaaaaata atttatcatt 993 00 
attaataagg aattgattaa aattggttga cagagattaa atcatgggag aaaaatctta 993 60 
aaagagttga caaaaaaata atgcaaaaca tagtccttca gtcttagtgc acaatgagat 99420 
ttgaagtcat gtaacgatca acaaataaca agaataatat aatcaatcat gtaaaataat 99480 
aataataata atagaaatca agaacaagta tgtccaaaaa atttagggag ggtaatcgaa 99540 
gattcctgac tcaatgtagg ccatattgtc taaagagtgc ttccaaattt ccattgaaat 99600 
taaagctcct ttttaggcac ctagacctcc taaagggtag tgtaggtggc tcggatcttc 99660 
actccacctt gttaaatttt ctttgtggga tctcaacgta cacattcaaa gtgattgaca 99720 
agttttttct tccagaattt tagtaattgt caaaacaata ttgttggtct gataggtcct 99780 
caaataaatg gaccagagaa atttttattt gaatcataat aggtaaattt aaaataatct 99840 
attaatttat ttagtatctg ttaaaaaatt attaaaaaag tttaatctat ccataattta 99900 
tttcataaaa aatataatcc attcattagc attttttacg gatatatatt cagcatatcc 99960 
ataattttat tttaaaaaat taaatattat ttgtataaat tttttgttta attcacttta 100020 
aagaagccat tatttattta ttttttaaga taatttttta aaaataaata tttatttctt 100080 
aaatttaata tattttcatt aatatataag aagatattgt atttttttaa aaaaaaggaa 100140 
tccaaatatg aaaaaatatg ttggatagct ttttatctgt aaatagctta ttagatggtc 100200 
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catgagattt gaaaataaaa gagtataaat 

ttcaataatg aaggctacgt tgggagagtt 
cttgtttgca agatatcatg gttttaaaat 
tctaagtgag aagaaaaatg taggaagaaa 
tggattagaa aagtaaggga gaaagagact 
aaaaaaatag ggagaaagaa accacaaaaa 
tattggaaaa atgtaaatgg ttggaaaata 
aaagacatta atgataaaaa aaataattaa 
aaactaacaa ttaatattta tcaataaaaa 
aattaattat tatgtagtga caatataaaa 
atgatttgta tgactttaag ataattttgt 
tctataattg gataataata taaaattatt 
aattaatatg gttgaaaaca ttgtaacaac 
atatgtaaga tatttacaat tcacaatcct 
cagctattcc taactataaa taacactata 
caatcagtaa ttcgatttaa gtagtgggtt 
tatatatcaa aaaatttaaa attatatatg 
ttatttgatt aagaaaaaat catttatact 
taacaagtcc acaaataata atttaataac 
gcatttttga ttttttatac acgaaacagg 
accccaaatc ggtctggtct gaccggatca 
actaaatcaa tccaatccaa atcgagacaa 
atttaaaatt ttatatactt ttttatatct 
ttttattaat aattaattat tataaaaaaa 
catttacaga agttaaactt gagatttaat 
ctaatcacat gttagtaaaa ttttatacag 
taaaatatac acaattaatt gccaataaaa 
gcatgtttgg atatttgttc aaaataggat 
agaaacttta ttttgcaaaa tgaaggattt 
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tgttattcat cctttcaatt atatgaacca 100260 

aatcataact tctaccatgg gaatgttttt 100320 
cataacttaa agtgtaagaa attaaaagag 100380 
tagattacaa taatatggat aagtttaatt 100440 
gcataaatgt aaaagattta agtttggaga 100500 
tgtaaaaaga tttgagttta gaaaagaaga 100560 
aagtgttaaa atcgcaaata caagaatgta 100620 
agtattgaaa taacaaaaaa attaattaac 100680 
tattaaaata aaaatgttaa aattgaaaaa 100740 
taatttctca ctgtcataca atcataattc 100800 
taaaattaat aaacttatga taaataataa 100860 
ttaaatagtt attgcataat ttttttttac 100920 
ctgtcaatat gagaacatta gaaacttagg 100980 
tattacatga tgcaaactag tatttttaga 101040 
gacttatttc aataactttc aaactcggtc 101100 
actggtccaa tgaactagta attgacttaa 101160 
tatttattat atataactat ataactaaca 101220 
tcaaaataat aattgaagta ttaaagttga 101280 
acaattacac aagtttcaga tttttttaga 101340 
ttttaaaaat aaaatcggct cgtcgggttt 101400 
tccgagacca gatgtatgac cggtccaata 101460 
actggatttt tgaaatatgg tttgaagtgt 101520 
atcatgttat agatataaaa tattgataag 101580 
tgatggtcta tcaatcacaa tttttttttt 101640 
ttaagaaatc taagtttaat tttactcaat 101700 
agtctaataa ttatagataa attagaattt 101760 
tatctatttt tcttctcaag aaaattacga 101820 
acacaaattt caagataaat ttaatgattc 101880 
gagtactttc gtaaacctca ttttagccta 101940 
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aaagtctgtt tgaacaataa aacaaacatt tactacctta ttattgcctt tccttaagta 102000 

gcaagttatt aattccttta ttttgcaagt gaatacgttt catattgcat gttttgactt 102060 
ggactaatgg acttctccat attgaccaaa ctcttaaaaa caatctcgat tgtcttcttg 102120 
cgaaattgat ttccatcgat ttctctctat tttttctttt ctaaattata ttgacaacta 102180 
tatatatata tatatatata tatatattat gtaacaaatt ttatgatgca gttagataac 102240 
aagaatcaaa caataaaaat atataaaaat aagcttaagt aattaatatc cctaaaatta 102300 
aatcagataa attaaaatgg tttgtatatt gcttgattgt tgacttgttg gaaaaaaatt 1023 60 
taaagtgtga ttggctctcc tatcttgtca ctaggaaaaa ttaagcaaca aagagaacaa 102420 
gtaaaagaaa atgggcaaag aaaaaagtgt gaattgttag ttggggaatg ttgggctgtt 102480 
cggctagaac agatattgct agaacaaaaa agagtaccaa agtagatttg gattccgaag 102540 
ttaggaaaga agggttattt taaagtggca ggtaaatgta aacttccaat aaattaaagg 102 600 
aaagggaaag aaccttacta tggaaatgtc tcaccgtgac attgtatgta tttggatgaa 102 660 
ccttcataaa attaattcta aaagaattga ttttataaaa ttaattttga agtgagatag 10272 0 
tttgtgttta aatgttttat tataaaatca agttaaaaat aaaattcagt acaaaaattt 102780 
tatatcaaaa atagaagttg ttgaaagtta ctttaacaaa aaatcaatta tacgtttcaa 102 840 
acataaaatt aaacatctat gtatctaaaa tcaattttgt attcagaatc aattttccaa 102900 
agttaaatca aacatgcact atttttttta gacgttatta tataaatcta tataaccttc 102960 
tgggagtagt ttaaacaaat ctaatctgga ccaaacaagg actcaaatta tcttccgggt 103020 
gtgaagcagt acttcatatc atcaaacaaa tgatagacat cgcatttcct ttacttttat 103080 
tattagtaag aatttctatt ggcacttttt cccttttcac ctctttttaa gataaagttt 103140 
gtaaaagatt gttcaaacca aaatcataat gaaaacttct gtgggtgtgg gtctacaatc 103200 
ctgcaaattc acggaagcaa caacattttt atcactttca actataatat ttgcatatat 103260 
aagggcattt tcgcattacc caaatttctt tccttttttg ttgttgtttt ggttttggtt 103320 
tacgtttggt taatacaagt atagtataca cccgatggat cttgcacaaa aaaataggaa 1033 80 
gtcatatccc ataagaaggg aactgaaaaa tgtgtgatga tcgtcaaact cgccccaaca 103440 
tcatctgaaa ggccaatatg tgacacaaaa taggtctgtg gtttgtggtg attgctaatt 103500 
aaaatatcaa accagaagac aaaagggaca aagaatgaca tgtagataac aacttagcga 103560 
gtcagttttg aacccttttg tctcaatggc gctgcatagc gtagcattaa ttaatgtgga 103620 
ttttaatata tcctacacgc aacatcgatc atatagattc taactggttt gggtatcttt 103 680 
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acatgatttt tttgcgaaat aagattatgc 

ttgtataaga aaatatttat gagggaagtg 
tgggcctgac acaaactctt catttcttat 
tatcatatta catttattaa caaatatttt 
agagggtttt tcttatttat aataagtatc 
ttttgcgtgt gctttagtca agctctctct 
agtcaatttc gttttaatta ttatattgcc 
gcatgtaatt tatgtaaata tttaaataca 
gtaaatttgt atacttacat gcaaagaatg 
attagaatga taactggatt tgagcgtaat 
ctttttattt aatgttatag gtacggaact 
agcccctata cacatataat aaatataatt 
tagtttaaaa atattttaga tatttttaaa 
tcagttcata aaaataattt aaatgtaaaa 
aagacacatc gaaaagaaaa tatattacat 
atacccattt gagtataata taagtatgga 
acattgcttt acgcatagat atgaagttag 
aatggataca ttacacaaat ggctagcctt 
gtaccatttt ctactacatt ttaattactg 
aaacattgac gtaaaaaaac aatacgcatt 
actaatctcg ttagatactt gcttagtgct 
cttttggaat ttggtatatt tatcgtggag 
gatgcatcga atcactagcc tagtagccca 
attaatattc caattcctta gcctatcaaa 
tgccaattct gatcataaca atcaacgcaa 
aataataata agatgggaat gcataggtac 
tttaaatggc atgaaaagca atgcataaaa 
taagacacga aaaaaatatt gaaatgatta 
cgtccccttc tactctaaac actccacacc 
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aaacaaatat agatggagaa gtactatata 103740 

taaggatgtt tccatgaaaa aaagaagatg 103800 
atgattatgt catttaaaaa gaatataaaa 103860 
aacacatatt aaaagtggaa ggattggttg 103920 
atttttatac taaaatgttc tttttttttt 103980 
ctctctcaaa tttcataaaa tttgtgaatt 104040 
cttttcaatt tcttataaat tcattacatt 104100 
ataataaact taattgtata aaaagtaatt 104160 
tattaaataa ttaatgcgaa tttaattagt 104220 
ggatacgcaa tataattact tatacgtaag 104280 
gtagtatagc actcatccag catctatatt 104340 
atagaattaa ctatttatta tatttataat 104400 
taaaataaaa cattttactt ttaatttttt 104460 
aacattttta ttttaataat ttataaaaaa 104520 
gaataaaata ctaataatat gtttggatac 104580 
tttgttttgg aaggctgaat gctaaagtta 104640 
ataaatattt ttcttgattg cggatattcg 104700 
actaatgatt tgacatatta ttattaagca 104760 
aaagaatgat aaatgttgta agatatcaat 104820 
taatattatg cttagggtgc acaccattta 104880 
ttagatgctt tgatttatta ggcacaaatg 104940 
gtcaggtcaa tggttattaa agtggagagg 105000 
atagaagtaa tcggcatttc gaaggtctcc 105060 
catctaatct aacatagaga cgtaaggatc 105120 
gtattcgatt tcaataataa taataataat 105180 
aaatgttcgt attaatttgt taactaataa 105240 
ccagccaaat gttaggctgt gtgtataatg 105300 
ggttttgtcc aatctcgtgt gaaggtatga 105360 
accctcactt aatcatataa aactcataat 105420 
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ttcttgactt tctaatattg ttggtggtgc 

tcatttgtaa gtaatttcgt ggatggtgac 
aagaactata gattaccaca aataattaaa 
atggtatgct acattattac acataatata 
caacatgtga ggtgagtgtg agtgtggtaa 
aacaaaatgt aacaatgccg gcgttgatat 
cagtgacaga gtcgtggtct acctggtgcg 
actgcgacac ttggtatgtg aaatatattt 
ttaacaacaa aaaaacaagt tccactctcc 
taatacgttt ttattttttt atttctatgt 
cccatttaca aatggtgggc tacgtttcca 
gaggcagtct ttgaaagcgt cgtcttctga 
gcgttcttct gcggtttgat agatagctag 
ataataagac aacatacata acgttttctc 
cataaggttt ggctttttag ttttttattt 
tcaagggagg aaattggtac agatcaaagt 
acatcaatag atacttagat agatagatgc 
attactggta ctttcttctg gcagttgtta 
gaacaaaaaa aggtaaaagg gggtgctatt 
gaaaaatgat gaggtggttt tgttgcccta 
tgcataggcg ttgtctgtgg cggttgtcat 
aaaacatggt ttaattaaat gaattattcg 
gtctggtttt tttcttgaga tttaggttct 
ctaaggaatt tcactcttcg tctctctcat 
acagtagcaa ttacttgatc tactctgtta 
cacccttcta ttcttagttt ctctctctta 
ttttgaagtg aatgtgtttg gcttgcaatc 
aattcatctt tttttgggtt ttgccagacg 
gcatgaattc agcatccccg cagtttgttt 
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cgtgttagtg cccgcattct atttaagtta 105480 

accaataata acttatacta acaactaatt 105540 
taaataaaaa cgaccatgaa atattaaatg 105600 
taatactata tgatataaat gtagccttgt 105660 
atctttagta ctatttgatt cttacaaata 105720 
attccggtgg cgttgagaca gtgaagggaa 105780 
cacattagcc atagtggaaa catgacgatg 105840 
gactctctgt agctcaacac gaccaagcat 105900 
aataacaaga cacaaaagtc tcctctattc 105960 
atttgtatat acataaataa ataatgtaac 106020 
ttttttatta tataataaaa attgaagtta 106080 
accaacacaa tgatggaatg gaagggtctt 106140 
ccatgttggc atgaagattt gagcgggaaa 106200 
aaagtggttt ttttatttta ttttttttat 106260 
ttcagttttt gtggataatt ctattaactg 106320 
ttggttgtac tttgccaact aaatattatg 106380 
ctttatttat ctttgttttg aactactatg 106440 
cttaaaactt ttgttgtctt tggaaaaagg 106500 
taggatttgg ttttgtgttc ctatgtttgc 106560 
ttaatgggaa aatcgtacta tagattctgt 106620 
acaaacgtgg tcggggtgga ggaagaaact 106680 
tttattttgt gcttctcttt ggcttgtgat 106740 
gtttgtttat cgtggattgt tagatattaa 106800 
agtacttctt ctggtgccaa gaaggttgga 106860 
cctttttctg tctttgcttc ttgctcctgc 106920 
tttttccctt tagttgtttc tgttattaat 106980 
ttcaatgcat tttaagattg attgagcttt 107040 
gcagtttgcc tgcactgatc agttcttaga 107100 
cctcaagaag gatgagtgtg tacgatccga 107160 
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tccaccagat tagtatgtgg ggagaaggct tcaaaagtaa tggaaattta agtgcagcca 107220 

tgcctctaat tgatgagaca gacatgaagt ttgatagtca ggtgcaagtc tatgaaatcc 107280 
gtgctttgtt tcattggcaa ataaaaatgc ctattctgcc tactaaccaa attcgtttgt 107340 
gctgtttaat acccagtcag aggatgcttc tcatggtata ctgggagagc ctaataagta 107400 
tgaccaagaa gctaacaaac ctactgataa ggtaagagca tgttatgcaa actcgtttcc 107460 
tcttgctcta tctcttcccc aaacaaaaga aaagaaagaa ggcttatttc ttggtagtgt 10752 0 
tttgttaatt acaacaacat atatttcttt accactgaga gcggatgtaa attttgtctc 107580 
atacccacat ggtttctaat actttctgtc agtgggtttt ccctgacttc cattagaaaa 107640 
aaaaaactat atatacttgg gaactcttga atgcttcctg caactgaatt gtcagctcaa 107700 
cttacctgta aatgtagata tagataattt tgagcattac aatttcttgt ttctttcatt 107760 
ggggcatttg tctggcatca aaatgttttt tttttcatta aatatgttgt cattttgaag 107820 
cttcaagtct tattctgtag ggtgttgttg tataatgtaa cgaagtaatc aatccattta 107880 
gaattatgga cagagttatt atttggaatg ttttggcatg gaaaatcaca gccttttcac 107940 
ttttctttcc cctccatctt gagctttatg tatacaatta gggtgacaat gacaagtatg 108000 
ttagtcctgc cattttggac tttgaggaat ctgtatttaa tttggatgta tggcttggat 108060 
tcatactctc ttttatgcct atatatggtg gaaacaatta gggatctaat atggtcaaac 108120 
atgcagatac aaagacgtct tgcacaaaat cgagaggctg ctcgtaagag tcggttgagg 108180 
aaaaaggttt gaatgaatat acgttgatac attgtcagac atgcaattag tgtttagata 108240 
tactaatgtt catgtggttg ctgatattgg cttacaggcc tatgtgcagc agttagaatc 1083 00 
aagtcgtttg aagttgatgc agttggagca agagcttgaa cgtgcaagac agcaggtatg 1083 60 
gagaagatga gttaatatac tagcagtttc tcttgtggat cacattatgt tgatcagcat 108420 
cttccttcct ttccctctct ctttaatatc ttaaggcaac aaaactgcag ggaatttata 108480 
taggcggtgg gttggattct aatcatctgg gttttgctgg atcagtaaat tcaggttggt 108540 
ttgttattat gctactgctg ctatatattt gcattgcact tggagttttt gtgtcttgtt 108600 
attccatgtc ttggtaactt gatgttttcg ttttttaaaa ttttcttctg tggtgaaatc 108660 
agtttttcca aatgtgctta aatctcttac tattggacat ctgaagacta aagtcttctt 108720 
tgtgtctgcc atctcaaaat attcagaaac attttctcta gtcctgtgcc aaccttttgg 108780 
tttcaaatca taatcagttt tgtaatagaa tccattattt ttcttccaaa tatccagaaa 108840 
gcggcagaat gtttaaagcc ttgatatatg ccagttggag aattatttat atgtatcttt 108900 
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catgataccc gctgccaagc tagattatca 
tctttgttgg atgttgtaca aaaccataat 
gtaccataaa catatcttga tagtgtctta 
cttattggca acacaagtat tgaattgttt 
gttagtttgt tttgattgat gtaatattgt 
tgtgttatac tctattttcc aatttgttct 
ttacaacctt tgagatggag tatggacatt 
aactgagaaa tgctttaaat gctcatattg 
gtatgatgag ccactatgcc gaaatgtttc 
tcttctatgt tatgtccggc atgtggaaaa 
gaggctttcg cccctctgag cttctaaagg 
ttaaaaatgt tattcataga aatgtagaat 
aggactttct cattattatt ggattttaat 
agagagatgg gggaaatttg aaactgcatt 
gatttactca aatgcttttt gtacttaaaa 
agagagctgg gtccagcaag aggtgatatt 
aaagaattgc atcctcacac ctgaatttca 
agattaattt gagattattt ctattgccat 
tgtagtatgg tgcttgctga caactttatt 
acccctgacg gagcaacaac gtttggatat 
agaagatgcc ctttcacaag gtatggacaa 
agctgggcag ttcatggaag gaacatatat 
gaaagctctg gtgagctttg tgaaccaggt 
tctcaattta gtaaaaaatt ccttttcaca 
gccagttgca atttttatcc agatgttctg 
atatactttc taatccacat gtgcttattt 
tagattcaaa atagaaaagc aacagtttag 
atgaaatagg aacagatgtt tggttatttt 
gttgaagtta tttgaagcta ttgatcttgt 
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ggtctcatac attaaaattg ctattccttt 108960 
taaactgttg gtaagttgag aaaatattgt 109020 
tattagcccc cagaaacaac ttttgggaat 109080 
agaatttttc atggtgccat gtaagcttaa 109140 
tcttggtttt ccatgagtat cttttcttct 109200 
tgcattataa taataccata tttgcaggaa 109260 
gggtgaatga gcaaaataga caaatcacag 109320 
gtgacgtaga gctcaggatt ttagtagatg 1093 80 
gcatgaaatc tgctgctgca aaagcagatg 109440 
caacagctga aagatttttc ctatggattg 109500 
tgatgcctca tatttaataa acgaaagtag 109560 
gtgtagatct aaattcagtc attgtctttg 109620 
caataacttc ttctactgtc ttcttgagag 109680 
catttattac agttttaata cattatgcaa 109740 
tggctatcac tcagtgtcag tgatagtccc 109800 
ttagatttag tcttgacttc ttgctttggt 109 860 
aattttaaaa aagggattta ttgagatact 109920 
attcaactat ggcatgctgt acgcttgtaa 109980 
ttaagtgcag gttcttggcc ccctgattga 110040 
ctataacctt ggacaatcat gtcaacaagc 110100 
actccggcaa acacttgctg atagtgtagc 110160 
tccacagatg acttctgcaa tggacaagtt 110220 
agatatcctg ttttcttgat agacattagt 110280 
acttgccata tcaatcaatg gattcctcct 110340 
tgaaaaaaaa tcatgtatgt gttctggtaa 110400 
gtgtgtctgt ctctctgtct ctactcacta 110460 
tgtgctttaa atatcatgat tctatgtcat 11052 0 
aaatgtatag ttttttgggt taagggtatt 110580 
aagtcaacaa atttttcata tctaaagtag 110640 
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ttcttataca tcattctata aattgttttg tagtgcaatt gattttaaat ttactttttg 110700 

gatgtgtaaa acccataatt tgatttgaag tagatgaaaa atatttcata aaataaattc 110760 
cattcctaaa ttcttagcac agatactgaa actgaaatat ttttcattct aaagaatcct 110820 
cagtagaaat ggaaaacaac aatttaatca atcaattcaa agttttaaaa cataattagt 110880 
ggtttatgat ctattttaag catcctctta cgatttgggg gcaagtcatt gccgtaggca 110940 
tttaatttca tctattatat tagctgtgac ttgtggttca cttcaaaacc tttacccatt 111000 
catcttggtc actgatagtg cttgctagcc agatttttat ttagcgttac caacttccca 111060 
tttagcattt aggtagattc acttaggatg cctatatcgt gtaaatgcta aggtcataaa 111120 
gtacagtgat ggttaatgaa tgatacatta acagaatcat ttttgaagat cattaactgc 111180 
tatatacact acaaatttac aacttcaaaa agttatagct tggctaatga ctattagaga 111240 
tttaaccatg ctctgtgtga aacaggctga tcatcttcgg caggaaactt tgcagcaaat 1113 00 
gtctcgaata cttacaatcc gtcaggcagc tcgatgcttg cttgctttgg gggaatattt 1113 60 
ccaacgcctg agggctttga gctcactgtg gtctaataga cctcgtgaac ctgcttagtc 111420 
tcacaacaag gttacaagag cgcaaactca taacatgaac gtgatgtgat gatgaatatg 111480 
ggtcagtttc tttggccaag agcatcaaaa ccatgtatca tgtgaggtcc aggttgctca 111540 
catttttcct gcagttttgc gttatggaga agttgtaaat atgactgcta gctcatttta 111600 
tgcatacgag tttgtaaata aattaatttt agacgcatgg tgtatatcgg attgttgcaa 111660 
aaatgaaata tttagaatct gccatgagtc tgtgtggaag ttggttgtgt agaattcgta 111720 
taatgttatg ttgtgtgttt ttgaaaatga atcgactcta gactctctta gtggacataa 111780 
tttatttttt gtgattgcta agggtagctt gttaaaaatc tgtaagttgt tattagtttg 111840 
ttagaaagtt gttaactcta ataagctcta taaatagagc attcgtagtc ttattgtatc 111900 
ttttattcaa tcaactgcaa catgatttta tatgatactc ttcttctcta ctctgatctt 111960 
tatggtgtat ggccacaata tgttggtgtt acttgtctct ttcctcgaag ttctgatagt 112020 
gatagtgctc aacaatattt atcataaatt tcatttttct tggatggata cttgagattc 112080 
tgtcctagat gctgagtgag aaaatattaa tgtgctgaag gttttacgtt ttcatttgct 112140 
tttctatata tatacttata ttggtaaacc aaacatacac accaatccac ttattcacat 112200 
tcaatttata ttgtcaattg tttacaggat tttcactatg caatgggatg tgcagctatc 112260 
aggatttgtt tccttatagc aaattggttt gatgatccac ttgtagctag caggatatat 112320 
aatcatcttg attccagaca gaatctcaca acttagatat agcacagtcc aaagcatcta 112380 
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caaaaaatct gcacatcagg aagccagtca gtgaacttca acaaagaatc tgaactccac 112440 

aacttggttg aagagagcca ggacagaatt atttatatac ctgcatcatc tttggtgaaa 112500 
cacattggtg gcttgatcct aaaaacattt ccatgcagcc ctcctttccc aactagaata 112560 
ccgagctcta aaatttatag gagtaaacag ttaccatgtt ttgtagaaat tttggaaaag 112620 
aaagcttcat atgcaatttt ttttattcaa ccaaatttct taccgctgaa tctttcatgc 112680 
aaagctgttg tttcagcttt tgcaggtggc ttctattttc tatcggtgac aaattctatc 112740 
cccaccatta agcctcttcc tcttacatct ccaatgactg gccaaatata taaaaataat 112800 
caacaaattg tggcaaatgt atgctctcta taatgaagca tcatatagat acctaagaaa 112860 
aggcaaatgt atcgtaaaat aaaaatatat cacaaaatga aaacatgaat atttctggga 112920 
tataagattc aagataacca tactgtcatg tacttgcatc atagatctta aacgctgaat 112980 
cgagtgagaa ccaacatcag cacaatgaca ttgacgcctc tccttgtcaa gaactctgag 113 040 
cacagcaagt cccccagcaa aacatacagg gttctctcca aaagtattaa attgaagctt 113100 
ctgggccatc acgcttgcta tttctggagt tgtaactaca gctgctaatg gcaaaccatt 113160 
gccaataccc tgatgcatgt taacatgagg tacaaatcat acaatggaat tctagaaaag 113220 
gtacatgaga ggcacttcaa caaaattggc cctatctgaa ggtaatgcta gaacaaaata 1132 80 
catagtatag tgaaggaagg caaacaaaaa aaaggaagct aaaaccgata aaaggataaa 113340 
cattgaagtt atgcttaaga gttatacagt gagagtgcaa caagtgagaa gaattcattt 113400 
aatattcaac tatataatgt ttaagattgc tttacttaaa cctatattta taattgtafcc 113460 
atcaaattag ttcatctaat gggaacatgc tctcatcgta catgcttcct atatgcataa 113520 
ataataatga acctcacaac agactgatgc tgcacttcaa gtgcattatt aaggtgccat 113580 
tgtatcctgc aactagttaa aacttagaaa gaagagatga gggtctagac ctttgccatg 113 640 
gtaactatat caggaatgac accctgtgtc tcaaatcccc aaaaatggct tcctgtgcgc 113700 
gcaaacccac attgcacttc atcagctatg cagactcctc cagctttgtg tataatgtca 113760 
taggctagtt taaggtaccc gggtgccagt tcaactgctc ctcctgtccc tagaagaagc 113820 
catgacatga ggaaaaagta caaaggaaaa ccacactatg gtgtggctat aaatattaaa 113880 
taccactctg aagtttagaa attaaacatg aaggatggat tatgcatgcc tgaatagttt 113940 
cagccataaa tccagcaact cttcctgaag ggccatagtc aatatgatct tgcaattctc" 114000 
tggcgtacct attggcatct gagccaaaaa ctccgcggta tggatcagga ttcatgacat 114060 
gatgaacttc tcccttaaag agttaaatgt tgtaaaaaaa aaatacaatt agcacttccc 114120 
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tatctttagt aaaaagttaa ttcatcgttc acaatgatat cgttacaact agagtagcat 114180 
ggattatgtt acaatgatat agtacatgtt tgtgtatgtt tttggataca tatatgcacg 114240 
tgtttgtgtg tgtggacaaa acctctggta ttgggtattt ctacgtgttc atagccgtgc 1143 00 
gtccaattgt actagaactc tctccatgat atgcattcct taaggaaatc atacccatat 1143 60 
taccagtgta taaacgggcc atcagcatcg ccaagtcatt tgcctctgaa ccagaattta 114420 
aaaaataaac aacctgtaat ataaattgct tagttgaaca ccatgttcca tattgaatga 114480 
atgaaatcta gcacactata aaactgaaca ttgcttttca gcaatctcaa atttggcatg 114540 
tcacagtttt acacactgtc atataatcac aaattgatta tttgaattat tctaaaataa 114600 
ttattttaaa agtcaacaaa cttaccatat atgatgaatt atgattgaat gtctaagtaa 114660 
aattttttag tgttagtgtt taacctattt ctcttacatt atttatcatt cttcatagca 114720 
ttaaattggc agagggttta ttaaaatttg taaatattta tgtaaagaga aatagatata 114780 
aaatattaca aaattttcat gggactagat catgtgaaaa attaatgtag gaggatcatt 114840 
cccttgtttg gatatgctaa ctgaaacaca aattatgatg ataaaattga ccttaaggtt 114900 
tccaggcatt ttggatgcca aggcctcggc aaaatcagct attgcatggt gtagatatat 114960 
ggttgttgta tgctgcaaga gtttactttg ctccattata gcattcaaaa ctgcaggatg 115020 
gcaatgtcca caagaaatag taactattcc agcaaaagca tcaagataac gcctgccatt 115080 
ctcatcaaat agatgttgca tcatgccctc tacaatgtta agctatataa acaaaaaaaa 115140 
ttgttaaaaa ttacaaaaat atgtggcact cttttcatat ttaataaatt ttatttataa 115200 
ttttgtaatt ctcaataaat tttaatggag tgtcagacaa aatatattaa agaagatgtg 1152 60 
agctataact cctccttact tatcggtatc gatgatggga ttaacatatg acagttcttg 11532 0 
tggagaccca ttttcaaata acgtaactat ttttacttca tttctttgac tcaattttca 115380 
gagaaaataa aactaagaaa acatagagaa aaggttagca acttatcata atatagtaac 115440 
tgataaaaga ttgattaatt tgggagactc acaagttttt gatagaagtg gaacaaggaa 115500 
ggaccaagga actttttccg tttggcaaaa acttcatcag ccaaaggtcc tttgtatggt 115560 
cgtggcttgt aatcaaatgg tggaagctga agcctccgag caatatcatg atcacttgtt 115620 
gcagttgaag tgctcaaaca agaagaagaa ccaaaaagcc tcttacaaac accaatgctt 115680 
ctcttcacca tcgccatgtc taaaccacct tttactaaac actacctata cctttatctc 115740 
aaactcgatc atccattttg taatgctttg catgccatgt ctcacgaata attgacatgg 115800 
aatcttgtta atttggatta tctaacattc aaaatatatt atgcggtaat tcattttatt 115860 
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caagtaatgt aataaatgat aatataaagt 
attgctttcc atgattgagc tagagcaaaa 
tattgccttt tattccccac ttgtttgctt 
tataatgaaa agaacttaaa tacgatagta 
agtgcaattg aaattgatcg cccaactaca 
atgtacatcg attgagtcac caagagataa 
aattaatcat ttattacttc tatttgtgta 
cttcatcatt tttcatgtaa tttgattggt 
caaagtgatg gaattaacta ctggaatagg 
tacagcaaaa ctacatgaaa acaagagggg 
aaattgtgta gatttttcat tcccattatt 
cgagaatcat gtaaaggaaa acttgaatag 
gtagatactt gagataacca gaagccaaat 
tacagcaagc agtataagaa tcatgcaaag 
tcacaacatc aatgaattat ctcaaaagcc 
cactgccaca agctttccat ttccaatccc 
tagaacaata aattggcaaa tagtcccacc 
tttaagggct tccctgatat cagcaggaag 
ctcataatta accacgttag gtagtagctg 
attagtgatg gacaagttga gataagtatg 
gaataacagt agcaacataa agacaaaatt 
gtttctgggg acaaaaatat ggattcaagt 
atttttcctt ggttttaatg ctgtcattca 
aagtatttat tacagtttat cttgtgacat 
atcaaaatag cgtatgattt tgtctataag 
agtataattt atgaaaacaa tttacaactt 
aattatagaa acaacttata cataatcact 
taattaagtt gtttattagt ttgtgcatac 
aagcaacccc ccaaaaaatt taaatacaat 
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ttaatttttc aagtcatcta ttagatatat 11592 0 

aagaaaaaag tagactgaat aatgtggttt 115980 
ggtgtaatta caccgtccat catcgcattt 116040 
tatgatcata tatatcatat aatgagtata 116100 
accatcgtcc tgatcatggc agacatataa 116160 
atcctctttt attttgattg agtttatagg 11622 0 
atacccactc cagtataaag taattcagac 116280 
gtaggaagct ttcaagggaa gaagagagag 116340 
aaagtgtgta catcaagcaa gctatgcacg 116400 
tagttgtaga gacttgagat aaccagaaga 116460 
taatttcaat acaattagca tttagcagta 116520 
acgagaacaa gcagggtagt agttgtactt 116580 
tgtgtagatt ttttatcccc attaataaaa 116640 
gagaaaactt gcatagatga gatgaagata 116700 
tgcaggaaaa ccaccctttc ttgggtcact 116760 
tttattttgt ctggacggca cagaattgtc 116820 
ggcaaggccc tttaggacat gacccttact 116880 
ttcaaagtgt tcacctaaag tggtccaatt 116940 
cataaaacaa catggcgagt ttctaaatta 117000 
cagtcacaat cacaaaagag aaaaatatca 117060 
cttattactt aaaatgaacc aggagaaaca 117120 
tagaagtttt aaattctgat aagttataac 117180 
ttatatatct ataactgtct ctgtgtttgg 117240 
aagtatttgt gcaagtattt gggaaaactt 117300 
ttgtttttag cttaattcaa tttgttctct 1173 60 
gtgaaaacag tttaatcata tttcctcttg 117420 
taaatggcaa acacttgttc aataagcaca 117480 
cagagcttac caatcagagg cagaaaaaac 117540 
acctgatgat agacccttgg agctgttaca 117600 
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gaagaaaaag gatccattcc tttaacgaag 
atgatgaaag cacctccact tgcacctact 
atggtaggaa atcattgcaa gtaaagaata 
atttcataac cattgcaatc aagatttgtt 
catttaataa aggttaaatt actcagttgg 
gtctctaaac tatttgtttt ttaattaagt 
ttttatggca gcttgaaatg gccagaaaat 
gactcactta aaaaatcaaa tgatcaagaa 
acttgaaatt tctcaaatag ttcagggacc 
ttaaaattct tatatctcta tctcttgatg 
ggatgataat gtggaagagg atagccacat 
attaccttaa gggcaatagt gggtgacatg 
aaattggcag gagctggtgg tggaacatcc 
tccatttcat tgttcagtac tattcctgta 
gcattcacag tgctagtcat ggaaatggca 
cttgtaccat gatcatggat ctgattccac 
tctgtattgc cacaaactta attttagttt 
tatatatata tacaacttag cgcccataca 
ctgttgtcat gttaacacag agtaacacag 
gtttgtgaag attgaattca ttacttcact 
tgaataagcc tagacaatag atagttattc 
ctcagtttct gttagtttag ccaaatgcat 
ttagctaaat tgaaatcagg gctacattta 
tatatatata tatatatata gatagataga 
gatagacatg tacagcataa tcaagttgac 
tatatacaag ttgcagacac ttcctcaatc 
tatatcatga aaccaattgg tcaagaaatt 
cgcaagttat gatgcattaa ttgtcaatat 
aaaagctgca tattgcaagt gtgtgaggta 
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tgattcaaaa gaacttctgc agttccacca 117660 

acagctttta gcttcccatc ctgtataggc 117720 
tgagtacctt gaattaataa tagtgaaagg 117780 
cttttgtatg cttgacaaga gtgattgtgt 117840 
tccctagaat atctgaacat tttcaagtag 117900 
ccctaaacta aaatttggat ccttggatta 117960 
gataatttat gacggtttta aaaccctagt 118020 
cccaattaat aaattttagt ttaaagacct 118080 
aattgactaa tttcaccttt tcttaataaa 118140 
aagctcaact tgagcagtag tgggtgacat 118200 
ttaagaaagt caaatcttga taaaggattg 1182 60 
gatgatagtg gccgctttcc tggcatgatg 118320 
ttggtaacat ttctaggcat ggaaaaatca 1183 80 
cttggtgaaa ggatctttga accaaaatat 118440 
tttctctcag gatctattat acttaaatga 118500 
ctgaaaacac aattcagcac aactgagaat 118560 
agaacttatt tttacttaga tctttatata 118620 
tacaatcatt ttaaacttcc ttcttatcat 118680 
aatgtaatgt gaggctgcta ctatacgctg 118740 
tgacggttaa atgacattgg taggtctaac 118800 
tgttgtaaac agtacatagc tgcatcacct 118860 
gtatatagga caaatatata tatatatata 118920 
aggacaaata tatatatata tatatatata 118980 
tagatagata gatagataga tagatagata 119040 
caaatgccta gaacacctaa agtcaacctt 119100 
aagatatatc tttctatgtt tcaagttgac 119160 
gatatctttc aactctagct tatggagatg 11922 0 
tttctgcatt ttattcttgt cattcacacc 119280 
gaagacaaac atggaatcta aagaacctca 119340 
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ttaacttgca caggcatgaa gctacgtgaa 

ttacctgctg ccgtaatgac tggggccaaa 
cacgttagca aa'cctatgag aaagcatatc 
agggtcacca agattcattc tcacggcaaa 
gaaaccaagg gcaccagaaa ggccagaagg 
ctgtaacaaa agacagcatt ggctaaatcc 
agaagaaaaa aaattattaa caagaacaca 
tcattggatg cccacctgaa gggggaggca 
tagatatggg ttccttctgt ttaacagtgt 
ctccagcatt ctggacatct ttaactaaat 
ttggaccaga ctctgatatt gttctgagag 
cacctatctt taagagtttt ccatttgggg 
atatatctga ctctgtttct tccatctgct 
tacgtgctag aatctcagct ggctttacaa 
tccacgcctc atgaaggcca gcaagttctc 
ccttcaaagt agtatttcca gcatacatat 
acaccttatc aattgtttat aattacttta 
tttaaaattt agttttgtta acatgggtgc 
ttattaagaa agaaaacatt tatcacactt 
ctaataaaat tagtctcaga aactattcct 
catcttccaa tatgtgttct agcatacaca 
aatcaagggg tatggttatt taccaaccat 
aagtacttat ttacttctgc attaatttcc 
ataaaactat aacattattt taaatccatt 
gaaaaaaagb tgttttcatg gttcccttcc 
aaaacaatgt atcagttttg taaactaaaa 
aacatgccct tgtattttta tcatatactt 
taaagatacc attagtcttt gtgttgaaaa 
aaaatgaaat aaacaaggag caaaattttc 
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aattagtgta aactcaaaga acttaggagc 119400 

agttttatta tcatttatgt catttttcaa 119460 
agaaagaacc ccagttatat ttacaaaatc 119520 
tacatgcttc aaagcttcaa tttctcgatg 119580 
gagtttatat tgatctagaa tgttcagcag 119640 
cttaatttgc taattataag atagatataa 119700 
aaggtagtga atcagataga cttactagca 119760 
tgccaaggag ctttagacca agaacatcat 119820 
aacttttgag gtctttcatg ctcaatatcc 119880 
taagaccaat caatccttca taaaaggctt 119940 
tcttggctag tttcttattg tagcaaatgc 120000 
caaatatgct acgaagaccc ttgtcttcca 120060 
tgtggaggta tggtgatact ttaaaccctc 120120 
gtcttttcca tggaagcttc ccatgttgtt 12 0180 
caggaactgc tacagatagg ccacctttgg 120240 
cctgaaaaaa atggataaat catgtttaat 12 0300 
aaaaaattaa gtactacata ctttttatgg 120360 
tgattctaaa catccaaaag taacaggatt 120420 
atgtttctaa tgcaatacaa actgtctttc 120480 
cgagataaaa agattgattt aaggggttca 12 0540 
aagatcgtag gatcaaactc ctaaccacat 120600 
gttggttaaa cattttcttt ttaatacaac 120660 
tgttttattt tatgttttta cttttaattt 120720 
ttctgttttc aaatgtttgt acagaaaaaa 120780 
tfcaaaaatct ctgaaacaag aacccaattt 120840 
gtgaaagcag aaaatgaaaa cacagaaata 120900 
tttattttat ttctttaatc attgagtgtc 120960 
tatcaatgca aggatatgtg atgatgaaag 121020 
agttattttt ggttttctct ctttttcact 121080 
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tgagtctcga tcatacaaca aatttatcaa ttgaaactac agtaagcaga ggaataatcc 121140 

ataattgaaa aggaattaat gattcagatg gttaaagagt taaaagggta ccttggaggc 121200 
aagagcagga gcagtttctc tcatatcaaa agcctttgcc accccattat tcaacctgag 121260 
aagcaaaaag gctccaccac caaggccact tgaagcagga ctcacaaccc ccaaacaaag 121320 
ggaagaagcc actgctgcat ccactgcatg ccctccttct ctaagaacat ccttcccaat 121380 
ccttgagcat ctacggtcat cagtggcaac agctccattg tgggcataga tcacctcacc 121440 
tttgtgtttt gggtgtccat ttgtttcctc aaggcttgaa gcaaaggcca aatttgaaac 121500 
caaaagcaca gcaatgacat gccacaataa cacagataca acaacacttg acatgtatgg 121560 
ggacagaatt tggtgtaggg aatgaaacat ttgctgctga cctgttgact tttgaagccg 121620 
agacatggat ggaacttgct tcagttcagt ttgctgtaat aacataaagc aaaagaatgg 121680 
atggcacttg gcagtgtcaa aagtggaaag gactaaggca acctcggttc ttatgttact 121740 
tttgcttttt aagcactgat gtctataaga tcctgctatt aataggggta tttgtacctt 121800 
aagtggaagg gtttgcctgt gttctgatcc ggtttaaaat tttaataagg tcaattttga 121860 
ctcgacagat atagaataga taagattagg tgaatctgat ttattgatat aatgtatatt 121920 
aatgttacat ataattaaat ggtagtatta aaaaagtata ctaattatgt tatgttttat 121980 
aacatatttt aatataaata atattaacat aaatatcata atattatgtt agaaaaaata 122040 
ttactttttc tctcccaaaa ttaatgttgt attttaaaaa aattatccca aaataaatgt 122100 
catgttaatt tattcaatgt agaactaatt attttttcta gtaatacctc tattaaattt 122160 
ctttttaatt tttcaatgta atactaatat tatttccttt catttattta attggctaat 122220 
tttgttaaat tactattttt ttattttttt tattttttta ttagttgtta tctaaaacaa 122280 
cactttgaga taaaaaaaaa agtatttatt aagttaagag tatgtattat taatagaaat 122340 
tatttaagaa atatactttt taaaataaaa atgtatagga ctatgcaaaa tataatatga 122400 
attcaagcct ggtcaaatca attgaacggg ttacaccagg aacactctca cttattaagc 122460 
atcccttaaa aagtaagtgt ttatttaaga catgtttaag ttacttaata aattggcctc 122520 
caacttacat ttccaaagat ctccttttgt aacttatacg ttgaacttgc attggaatgt 122580 
tcgttaccac actttcaatg atcatgcgct tatacgttga taatgcatcg gaatgaacgc 122 640 
taacaaactt ctaacgatga aacaaacatg cacttaagca tcttgttctt aagtttttta 122700 
acccaattaa tatttaaaaa ctaaaaaaat taatgttaca taaattttaa gaatgtaggg 122760 
ttatttatga gattcgttca gaaatattgt attagaaaaa attgtttaaa tcatataaga 122 820 



71 



WO 01/51627 

taaaaattag tgatgcataa tgataccatc 

acaaagatac tccaatcatg tgatatgatt 
tgtaagtaat ctctcacgta cctataaagg 
tgtctattta aaatttgtat taataattaa 
atggaacgtc tgaataaata gcgtcaaaag 
tgaagagtta aggaccactc caccaatcat 
actcagcctc aaccattcaa atgatgacag 
ttttacatgg cacccaaggg ttgttcaaaa 
ttaatctaaa aaaattgcaa ttggcttaaa 
ttttttaccg tttagttcgc ttttcaaatt 
gcgtattata attatattaa tttttattgt 
gtacttttta gttgctttat aatcaaaata 
ttaccaatat tatgaggtag aagacagcca 
aacttaacta aaaatcattt ttatcaaata 
aatatttgat tgtgaataac tcatctttca 
taattgctag agcatttgag tggctgcttt 
tcaacacgca actgcagtca acgtgtctgt 
taaacctgat ttttagaact cgtttaggtc 
agattgtatg ttgttaagtc tttactcttc 
tatttaaaag acaaattata accctctcaa 
cattttgtta caagcaaact attaatcttt 
gcacacagca tctaggaaaa aagtcaaaaa 
ttgtttattt ttctttcttt ctttcttttt 
gtaatggcag aaatcagtgt tcagtgattg 
agacaatcct tttgctcacg ttgtgagttg 
acaaaattta caaactacaa tgttctcaga 
actttaatta tagtagtgtt caaaaatgag 
tattttaaat tgaaattaaa tttaattaaa 
atagagagaa tttcgatgtc agtattgtta 
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cacttaaaat atgcaaagta aaatgctaaa 122880 

ctttgtaatt taaaaaatat tctttgtaat 122940 
ttttagaaac tacaagttta cgattatttt 123 000 
taatcaagtt tgagtgagaa aaaaataaat 123 060 
acaaagtgct gaagtagcac gagggaattc 12 3120 
gttaatggtt gagagatgct ataggaccga 123180 
ttctcaatga atttcttatt taattaaaca 123240 
attaaacaaa atcatttata aatcgtcaaa 123300 
ttgaaaatta caaaaattga aaattgcata 123360 
ttgttaaaaa aaaatcaaac caaaccacca 123420 
gataattaat aataagtata aaataattat 123480 
tatattttac tttcaaatgt taaaaggaat 123540 
gttggacata agcaataaaa agataaataa 123 600 
tccaattcat catatattat tcaattatat 123 660 
aattgcaaac caagtcatac gcggacaaat 123720 
aagtcaaatt atggcaaaca tactggaagg 123780 
tttaattttt tcttcttcta aaaaatacat 123 840 
aaatggcttt gggttcgtca agttatgatc 123900 
ttaaatagac tttttcaggc ataaaacagt 123960 
gtctcaaact tttgacaatt tttttttttt 124020 
tgaaaataaa agttgacctg ggaagtggtt 124080 
gcaacgctgg attagaacaa aatataatta 124140 
cattttttaa tgacatcttt ttccattgtt 124200 
atgatggtcc cgatctgatc aaagattcaa 124260 
ttttataaat tggacaatca aaattgcatg 124320 
tgtcacgtga aatagatgat gcacagtact 1243 80 
aaattttaag attattcaaa aatcttgaaa 124440 
tctctttgat tttttaaata ttctgtttag 124500 
tccaaagact atttgacaac gaaatcgaca 124560 
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aaagaaagag agaaaggaga gagatgggaa gagcatagag agaaaataga gataggacta 124620 
aggagaaaaa ataagtaaga agaatttcaa attcttgctt tttgactttt tagtgtaaaa 124680 
attgaaatat tttattttaa atagttaaag catttcaaga aattaatata atttcaaata 124740 
ctttttaaat acattattca aataacttat tagttttttt tctcggagac ttgagaattt 124800 
tagtgtactt ttaaatctga ttttttttta aatagcggaa atcaggttta aaagagtcta 124860 
cggataatta gtatgaatac aaagattggg cctgagctcg aattaagccc agattacctg 124920 
gggaaaacaa cagataaatt tacatgaata attttttttt tctaatttgt ctgatttctt 124980 
tctgtaaaaa tattaaatgc gttttatatt tttaagacat taaacgtatt tatttttatt 125040 
taacattttt ataaaatagt agaaagtttt attaagagaa tatatcagta ctcaattcac 125100 
ataatgtcat tgtccaaatt ccaaatatct ggacgcatta aagtgaagtt acctgttgat 125160 
tgatcatgat ctacttatct tggatgtact ataaaagccc aatatggata cacatagaag 125220 
ctagcccaaa ctgttcaatc ctatcactga cccgtttaaa acaaaatgga tgagttgacc 125280 
agttttggaa aagtgggtgg ttaaaaatct caattggtct cattcaaatg tgggttgatt 125340 
tgaaaaaaaa caaaataatt atgaaaatgt gctaaaaaaa ccaaatttta ttataattga 125400 
gtttgtacgt taatcactca tattttcatt ttataatctt aaaaaaaaac tttgaaacat 125460 
ggtgtatgtc aatatgactt atttaaacta tagttttttt tttactattt aaactatggt 125520 
ttgttaaaaa taaaacatag ttaaataagt tatagtaaag gtactcttgt tcttcaaaaa 125580 
aaaaaaaaaa agaggtaaag atactcttgt aactaaattt tcttatttct ccgtatcact 125640 
agaccatgag cattcaatga atacataaca atgatttgtt tgttggttgc ctaaaactca 125700 
gacttgtacg ataaaaggat gactacaagt caacaacaaa tatagaagaa ataaaatcaa 1257 60 
aaaaaaattg gccaaattat gttacatatg attttacttt aatttaactc cataaaaaaa 125820 
gttctataca tgagaagtac aaattgaata aaatggtatg atttttttaa ttttcatttg 125880 
tttcaattaa tgatttaatt gattttcagt ttatttaatt atttaatggt ttaatacatt 125940 
ttttttatta cagcagaaag acaaaatcta tctactcctc atatgcatta ccccaagcgc 126000 
atcagcaagt acccgaggaa gaagacccgc tggacatgat gacagaagga tttaatacga 126060 
tttgaataca ttttttagaa acaactctat caaaattatt gacggtacct aaattgattg 126120 
tatatttcaa gcttaattat gtttttaata tcaactaggg gttggttttt tagtctttca 126180 
aataaaaata tatatatttt tagtctttta aattaaaaca aaaattgttt ttagctccct 12 6240 
tccaccatct ttttccaaaa caaacataga gtagttatag ttgttctatc ttgtgtcttt 126300 
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tagcaaatca aagagattct aaaaagagag tatttcttta attcgagaga ttaaaaatta 12 63 60 
caaattgtca tttgaaagac tactaattaa aagttcaaaa aataattaag tttatatttt 126420 
attactattg ctaaactttt ttccaaaaca aaaatgctaa aatgaggcta agtacacgtt 126480 
ggtggtgaca caggaaactt cccttcatat gccttgtcag tatattaatt tacgaataac 12 6540 
ttgtgctcca tgtgattcga ttttatttaa gtgtaatttt ttttcttgag gcgaagataa 12 6600 
tattataata taaaaaggat ccaaaaccac aatcaaggag tgatgatttc ggaccaaaat 12 6660 
tgtttgttta attaaagcat acttctgtct aacttataga gttgacaagt aaattaatac 126720 
tatcactaaa tctccacttt actatttttt aaaaataaaa tttaaattta tgatgatatt 126780 
aattaaatta aaaactaaat aattataatt ttgatcttct tgtaattctt aatatgtaat 12 6840 
tttggtattc ttattttaaa aaattataat tttaattcaa tattcaactt tccgtatatt 12 6900 
tattttttat tttaaatttt cacaatttca atctacttat ttaatttatt tgagttatca 126960 
tattataaat aaaaaatatt cattattttt ctttatatac aaaacttaca agattttatt 127020 
taagaaattt gaatccaatt ttactcctac tatcaacttg tttttgtata attatttact 127080 
ttaaagaata taaaagtaca acctatgtta tattgttagt taacaataat tatcgaggat 127140 
cacgatgatt ttgtaaagaa ttatatctat gcaatttttt ctttatatac aaagctt 127197 

<210> 2 

<211> 335913 

<212> DNA 

<213> Glycine max 

<220> 

<221> CDS 
<222> 

(45163) . . (45314) , (45450) . . (45509) , (46941) . . (48763) , (48975) . . (49573) 
<223> Seq ID: 240O17_region_G3 

<400> 2 

aagcttataa aaatatttca aattttatta ttgaatcaaa tatactatct atagtggtta 60 
acaaaaatat gaatatgcat tgatttttct agagagggtg ctggggtcca aactcgaagc 120 
gtatttttgt aagaggaaga aaaatgtcaa atatataata ttgaaatatt ttcactaaaa 180 
ttatcgcaca tcatttttac aattatttta ttgtattaca tcatttattg tttatttatt 240 
ttcatgtaca tatttcttaa cctttttgca tttataaatt ttttaaaaaa ttataaaagt 300 
taatcacatt gtttgaatta actcaatttt tatttatgaa atagttactt aattttagta 3 60 
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attttgatta gtgattattt gaatcatcat tcaataaatg aatacatatt taagtgtaat 42 0 

aattttcatt agttattttt atgtatgata tttatggtat ttaaatttaa aatattattt 480 

attttaccta ttttttttat aagccaatat gttttattaa tatttattta ccttttaagt 540 

agcacaacat ttttataaag caaatattta gatgaatcaa gaataaaaaa taaaattaat 600 

ttttatacat tattaatata aaaatattac aatatcaatt aataaaaagt ataataaata 660 

taacccttaa agtgattatt acggaaacaa caaattatca cttttatata tctattttat 72 0 

ttctataatt atattcataa gttactacat attattttat cctccttaca actactttca 7 80 

caaatcatcc actaatttta tttgaaaaaa caaaacatgg aaaaaagtta gacacaaaag 840 

gcttgtttcc ctagtaattt atataagaga ataatagtgt aattaacctg tgtgactgca 900 

tacacctttt acttattaaa attggttggt atgcaatctc tgtctatgcc atttcaattt 960 

cgatctaacc cttgtgttgt atcaattcct cttaccacag ttagcataag taacatactc 1020 

acctttattc caagctctaa ccatttcgac ttgttgctaa ctagattgag gatagtgatc 1080 

tttgtcttcg ccaacaacat cgtccacttg aacaaaaaat caaaaagacc ctgaaatatc 1140 

tgaactttta aaagccgcac tttttatttg ttgtgtgtga gattttgatt tgatgggaaa 12 00 

atatcttgaa tctgcggcga ggctggcgga gctatcgcgg attgtgtcat ctgctgcgaa 1260 

gcccaacagg ccaaagagag cgctacccag atcgcccaac cgtgtggcga ctcctcgaag 132 0 

tgcgaaccca tttggtgtta aggtcgaacc tgcgaaaaag atggaacctt tggaagaaga 1380 

acaacaatgt cggacacccc tcgccaaggt tgtctcagat tgctccaagc gctggttcca 1440 

ggacaccctc aaggaggcta aggctgggga cactaccatg caggttttgg ttggtcaaat 1500 

gtattactct ggatatggtg ttgcgaggga tccccaaaag gtcttttctt tcttcttatt 1560 

tttatcattc tatgtgcaat gtaataattg attgtacacc ccttgtaaag ggtatggtca 1620 

tagactcata tatattagtg cttgattctt ttaagttatc cctcatttga tgccaaaatt 1680 

atgtgatttt gttccctcct tggaattgga actgagatag agtgcaggga agttgggcat 1740 

tttacttgct gttagatttt aaatcccttt tcttgtgtta taatttttat cagattgttg 1800 

ttttttgatt tagaagctct ctcatattag cttctccaaa agctggtttt taagtcaatt 1860 

ttagcttatc agattgttgt ttttcctttt attttcttct cctataagtg ttcatggaga 1920 

agtttatcca aatatgcctt taaattgctt tggtggaagg ggctgttagg gcttctaaat 1980 

cttgctcaat tgtcaatcaa gacccctcac tgcagttaat ttgaattgtg taattggtat 2040 

gctattagct gcttctcaac ttgaggttac tgactatact gtgacaaact ttgtttatag 2100 
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gggcatgctt ggattagtaa agcatcaagg aaccgaaatt cagtttggaa attgtgcggt 2160 

aaacggccag gtaagctcta ccttttgttg gactagtgaa ttgtggtttt atgtgtcctg 222 0 

attgatctct attttgtgac attcaggtta cagagcaagt gattctgatt catgtgaact 22 80 

ggaggaaaaa gataaatatt ttccttaatt tgcaagtata actctttatt tttttttctt 2340 

ctaaaaagat tgaatttcta tttcaattct atggtatagt atatgtgaac ttattgtcac 2400 

tgatactggt ggttatttat tgtaacatcc aaaacataag tcaaattttg ttaactgaca 2460 

gttgctctac aatggtgtct ctatttttga attagatgtt gtgttcattt tcatgaagat 2520 

gttataatat actagtaata acttaaaaca acaagtgtat agcttatttc ttatgcctat 2580 

gtaagcattg tgttctgtcc ttttttcccc ctccaaattg atatgctcct agtattgacc 2640 

actccgcatc taggacttaa agagaaactc ttgcgcttag actttcatcc atactcctca 2700 

attccatggg agatcatagc tgtaaatgtt atcttctgat ggtctgatac caaaaagcat 2760 

ctagtttact tttgccttgc tgcaattagt atgataagct gcctgaaact aattgtgttt 2 820 

gattcagttc tcattttgaa gaaataatgc actgactagc agtctagcag tattgttatc 2880 

taggattcta ggtgtattca gtatgatatg ccgctatagt atgaagactt aaaaggtcag 2940 

gctgctgggc actgacatct attttgggta tctcatgtca aaatcaatta atacttagga 3 000 

tgcatttgga agaacttact tatttacaac ttatttggaa ttacctcatg atataagctc 3 060 

ttgtgtaaat gtttctgaaa gcttataaaa ttagtttatg atcagttcat aagttatttt 3120 

tagctcatta atttcagtaa gctctggafca gcttatgaaa atagcttaca acttatatta 3180 

aaacagttta acattatttt ctcttcactt atagaaataa cttgtacata agtgcatgta 3240 

tgataaacac tcgattagta agtgcttaat taagattttt aaccaaacaa gcccttatgt 3300 

agtgtttttg ggtgttgttg ctgcaatcag tataataagc tggcaagtac aattgatatt 3360 

tgtttgtatt cagtgttata tgtcttgtgc ctttgagtca attaatagga tcacactgtg 3420 

tatttagtta acctgataag taaagactga ttactgaatt aattgcattt gcagcattgc 3480 

gagcccaaac tgatatttgc ctcatgaaat atgaaagatg ttgattacta atattccact 3 540 

ataacatata attgcttgtg ataacaaatg gataatcact ttttggtttt ggttttacaa 3600 

atagaaagct tgcatgctat catgtttgtt gacgatcagg acctactaat gggaatttag 3 660 

agaagactag ggtgtggtgg ccttttacgt tgtgctatat tgtcttttct catgctatta 3720 

cttctcaaca ctgggtagaa ttcctccaat ttgctttttt atctcttata aagtaaatgg 3780 

cgattaggaa gcaattaata tcacatgata ccggagaaga atccatttta ataattgacc 3840 
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ttggtaattt ttgtgaatgg gagtcgctaa ctcaaatatc agataaatga tgaaaacata 3900 

acatcatgaa aatagtcagt gttacaaatg tttatgttaa cggttcattt atagatccaa 3960 

aaaggaaact aatataattc aaatgattga taaactaaag gacaaatttt acatatggtt 4020 

ttttaggtat tgtttttcac tcagaattaa agtttcaatt tgttaatatg cataatcatt 4080 

catggcaggg gagaagtttt tttcctatgt ttagaaatca tgaatcctaa gaatcatttt 4140 

tttttggaaa cgagtaaaat ctgtttggtt gaccatagat aaattttcta gaaattaaag 4200 

agttatataa tatttttatt aattatttat cacatcaata aaagtaatat gatattttta 4260 

ttataataaa agaaaaatta aacttgaaaa agtaagttta ctacaatgag aaaaattttc 4320 

ttgttcgtga aaaaacttgt taaaaaacat ttcctaaaaa tattgttctc taggaatctt 43 80 

atttttaacg gatatcaaac atgggaaact aacattttta tctcaaaagt ttcataaaac 4440 

tcaatttttc cccaccaaac acctcctcaa ttgtagacct gtatttcaaa tattaactat 4500 

atgaaacttc aattctaatt gaaaaataac aaaacaatac ttatgaatta tatgtaagtg 4560 

ttgtcctaaa ctaaactaaa ccaatcaaca aagaaaataa ctcaaattct aattgaaaca 462 0 

aatgaaaaga tcatttcact gttctagata aattttatga tatataatgt aaaagtgaat 4680 

taatacctaa attaagatag tacacaggca aacttcaaga aaagagaatt aatgaaagat 4740 

ttctacaatt acaaattatt aattcattga atagaagaag tcccaaagcc ttttgttttc 4800 

cagtgaaaga accaacaaaa aattggttga atagagagca ttgcgcgctt tagttgattt 4860 

gatgataatt gaagataaag atttatttat attgtttcgc tatatattcc tatgttgttt 492 0 

tgaggcgaag tatgatgctt gccgacctca cttgctaatt agtggaatcc agacaactcc 4980 

caaaccttaa aataggatta aagtttcacc ttttgtttga aatatttctt acagaaagta 5040 

aaagcttcca caattcacgt taagaatcca attctgtctt gtatttgctc catttcacgc 5100 

ggtttcgtaa atcattagaa ccccacttct ctcttttttt tttctttttt tctttctctt 5160 

ccacttctca tctctgtcca aaccatagag ttttactcac acctcttcag ctttcaatca 5220 

tggcaaatgt aaggcttttt attaatttct tgctttctat ttgcaactat gagactgaga 5280 

aggcttttct tgttgcatgc catcatcagg ttgtggaagt gaaagtcggt ttgcactgtg 5340 

atgattgtat caagaaaatc ctcaaggcca tcaagaaaat tgaaggttct taattaattt 5400 

tctcaacacc aataacacat aatcactttt tgtgttaatt agtttaaaat ttgtgttctt 5460 

tgcagatatt gaaacttata acgtggacac aaagctgaac aaggtcattg ttacgggaaa 5520 

cgtgacaaca gatcaagtga tcaaagtcct tcaaaaaatt ggcaagaatg caactgccgg 5580 
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ggaagatact caaaccaata agtgaacttg tttaggatag gacccttttt ttccctcgaa 5640 

acatatcctt ctaaactatg tactctgaaa ttcaataatc tatatattgt gaatagttta 5700 

attttattct tgaatttcaa ttgcaacact tgatatatat gttttttctt ctttatttaa 5760 

ttttaaaaga gagattaggg aaccattgga tgattgattg attatgcagt tctttagctg 5820 

ttctgaagtt tcatttcact aacctatgta tccttacatt taatgtaact tttattatgt 5880 

aattttgttg acagatttta taatgtaaat tactgacacg aaatttctat ttggattaaa 5940 

aacaacacta aactggatat aagtcttgtc ttttatttgg gaaatttaat ttatatatac 6000 

cgatattata aagttttttc acactgatat ttacttgtaa attatttttt ataataaaat 6060 

tgttactctt taaaagttat acctaccaac taatttttaa ttaattgata atataaaaaa 6120 

ctaacagcac ataaaaacta aaatctattt aatgttataa tcgatttatt atttatacaa 6180 

gtgcggataa agttgaaaga agcaaaaaag tgaacactaa caacaccaag taaaatatta 6240 

attgttggct acctctttct ctcacgtata atagcgtgaa gtgttactac tgaaccgtgg 63 00 

tgcctgatag caaaaaaatt atttttttgt gacggtctta tccttgagtc gtccctgtaa 6360 

tctagctaaa gtaaatgttg cgtgcgttcg ctattatata tatataattg acaatattaa 642 0 

ccattaacca caatgcattt gtatggcaat agcttttcga tttacgtagt tctgtggatt 6480 

tgctaaaagg tctcaagtta atatgatttg tatcctccac acccaagttg tataaatata 6540 

gatataaata tttcctggct ttgctacaag tctatagcca ccttacttga aggcaaaccg 6600 

ttaaatatgt caaatgcaaa atacaaggtc aacctcttat tctcttgtgt atacttcttc 6660 

atgggtttag agcactgtta tccttcctct tgtttttctc aaggaaatta aagtactttt 6720 

taacaataag cacaagaaca actaaggttt tttttttcct ccttgatatt gattcacaaa 67 80 

tatggccgct gtatctgaaa ttggaaacaa gacattggta tgccaaaatc cccacagttt 6840 

tggtcacttt gatgtctggc accgaggcaa cccattagaa tcccccactt gtcttctctt 6900 

tctgcaggtc tctatgatga ccatagtcac acagataatg gacgcatgtc tcaagccact 6960 

aggtcaatcc tcccttgtat cccagattct tgtaagcttt ctctacattt tacaaggtgt 702 0 

ttaatgatgg tcactagaat cagtgctcta tgaccatgta attctaaata tctacagaaa 7080 

tattagtaac actcctttga atacattttt tagtatagat taaaatttgt taaatactat 7140 

aaagtttggt ggttatgcat tttacaagaa agacacttgt agattgacag gactatgata 7200 

gctgtaataa tttctcgtga tagtgtgtgt tgctaccact cctatcacca ataatgtaaa 72 60 

aatgtgattt gtgtatcaat aatgtaaaaa tagcgtgttg ttaacaaatg atgcatggta 7320 
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acagggtgga gtgctgtttg gcccctcaat gttggggaac aaaaacattc tggggcaaac 7380 

cttatttcct gtgaagggtg ctgtagtgct tgaaacagtt gcatcgtttg gccttatgtt 7440 

cttcttcttc atatggtgtg ttaaaatgga tgttgccaca ttgatgaaga ctgaaaagct 7500 

ggccatcact gttggtattt ctgtgttcgc atttacattg gtaatcccta ctggactagc 7560 

aattctgttg aggaaatatg ctacaatgga cagcagcctt gcacaggcac tgccctttat 7 620 

ggctctctca caaaccttaa ctgtattcat tagcatagca gtgctcttga aagatctcaa 7680 

agtccttaac actgatatgg gacgtttaac catgtcagca gcaatgtttg ctgatatagc 7740 

tggctttacc ttgacagtga tcatttttgc tgtactgcag aatcagagtg gtagctttct 7 800 

gacactagca ggcctccttc tgtccgtagt tgcactcttt cttgctgtta tatttgtgat 7860 

gaggccagca atactttgga cggtaaagta ctcaggtggc ggttcggtta atgaaagctg 7920 

cgttgtttgc atctttctct tagttctctt atctgcattt atcagtgagt taattggaca 7980 

gcattttatt atgggaccaa taattttagg cctggctgtt ccagaagggc cacccatagg 8040 

aacagctttg ttgagtaaat tggagacaat ctgtatggga tttctttatc caatctacct 8100 

cgctgttaat ggattgcaaa ctgatatctt caaaattgac ttgcaatctc tgtggattgt 8160 

gggtctcata ctgatggtgg ctttcgttgt aaagatttgt gctgtcatgt tgccaggata 8220 

cttttacaat ctacccatga aacaatgttg tgtcattggc ctccttctaa atggaagggg 8280 

tatagccgag cttaccatgt acaatatgtg gattggaagc aaggtatgta tctatgtatg 8340 

tgcatatgag attgagacta gtgaatttaa tttcaaattg atgccaaaaa tgtgtttcag 8400 

gtcgtcaaaa ttagtcttaa ggcactggga ttcactgatt ataaaaataa gaataccgaa 8460 

taccgaaatt atcattttta aagtatatgg atgaaacaac aagtcgttgg ttagtgtagt 8520 

attggacttc attcccttaa acaggatctt agattcgagt cttgtagatg aaaaaaatat 8580 

aattgggagg gacaacccta ctagaggtgg tcagccagat tcttcaacag agattgatta 8640 

tcagcaaagt taatgatatt ccgtaccaaa taatatggtt aacaaaaaag tatatggatt 8700 

aagactaagg ttaattgagg gacataataa aacacatttt acccggttga ttaactaaca 8760 

gtctaatact tgttctcttg tctatggtag ttaatatcgg aacaagagtt cgctttgatg 8820 

gtggcctcca ttgtagtagt aaacgctatc ctagcaccaa tcgtaaaata cacatatgat 8880 

ccttcagaac aatatcagac cggaagaaga tgcacaattc agcatactgg gcgagatatg 8940 

gagctccgag tcatggtgtg cattcacaac aatgaaaacc tcccaacaat cctgaacctc 9000 

ttagaagcat cctatgcaag cagagagagc aagattgggg tcacagcatt agtcctagtg 9060 
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gagcttcaag gaagagccag gcctattctt gttgataacc aaaaccaact ccatgatgag 912 0 

ctgcgctcaa tgtcttgcaa cgcaagtcac attgagaatg cattgaggca atatggacaa 9180 

cagaacgaag gatatgtatc tgttcaatct ttcacttcaa tctccacctt tgaaaccatg 9240 

tatgatgata tttgtagaat ttcattagag agtggatcca acattttgat cttgccattc 93 00 

cacaagaggt gggaaattga tggcaccgtt gagatttctc ataggaccat ccaaaccatg 93 60 

aacattaatg tcctccaaag ggccccatgt tcggtgggaa ttctagttga taggagcatc 9420 

ttgaatcctt ctccttcact cttgatggct agagcagcat tctatgtcgt ggtgttcttc 9480 

attggcggtc aagatgacat ggagacatta gcctatgcca ctagaatggc taggcatgaa 9540 

tgtgtgtatg taaccgtggt aaggttcctt ctatttggag aggagaattc taaagataga 9600 

aaacgtgaca gcgatcttat agatgagtat agatactata atgccagaaa tcgtaggttt 9660 

gagattctgg aagaattggt gaaagatggg atagagatgt caacatgtat aagaagattg 9720 

atagattatt ttgatctagt gatggtggga agggagcatc cagagagtgt tatttttcag 9780 

ggacatgatg aatggagtga gtgtcaagag ctagggatca ttggagacat gctagcatca 9840 

ccagattttg tgaccaaggc atcgttgttg gtggtgcaac aacagagaat aagagggagg 9900 

cttgttacac ataatgtgaa tgccactcca gtgcctaatc aaagagatca gctcctacat 9960 

gatgttccaa ttcatgaaac ctttagtcct tcatgtacta tttcagtgga caaatatgac 1002 0 

aaaatgtaga tagattcctg tgtttagtga attttccttg tcgatatata tatcatgcaa 10080 

atcacagatt ttcatttaaa accacaatat aatgttactt tcgacaaagc catgaaaaaa 10140 
taaaattgat catcacatat ggttagtact tgatactaat acattgccct cacttaataa ■ 10200 

ttttgaatta aaaaaaatac ttgaatagtg gaatggtact aacttcctta aaggtgtgaa 10260 

gaggcataaa tggaaaaatg tgattttatt taatataaat ggctaaatga gtaaaccatg 10320 

tttggttctg cacaatagtt atgaacgttc aaagtctgtc ctacccaact ttgcatgata 10380 

ttcaacctta cgggcatgta atgaggaaat ccccttaatt ttactaaaat aaaaatcaaa 10440 

acaaggataa aaataaaata cagataaaaa atccttaagc tacaagtcta atcagaaaga 10500 

aaaaaatatg atccagaatc atcaaatgtt tacaattcca attaatctct ttgtaacacg 10560 

taatgttaat tcttttcgta aattaaaaaa attcaactac atgttgtgta aatttacaaa 1062 0 

aattatatac acataacaat ctcaaatcaa aaaataattt ctgaatgctc atccacagaa 10680 

aggtttaaac tgcacgtctt cacaaggtgg attccttaat acagtcagtg tatatatata 10740 

aacctaaaag taatttatta gaaggtgcta ataagttggt gaagcaagcc aatctttcca 10800 
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taaagcaatg gtcatggaag gtgatacttg gaaagaaagt acctgaattt cttggtaaag 10860 

caagaaatta tgtaaacaaa gacattggcg tcaggaaaca gcatctccat tttaagtgcc 10920 

aatttatggt gcgtattcaa acatcatgtc aaataaaaga aagaaattct tacgactttg 10980 

taccatttca tgctgtaatt gaggggtaca catttttttt ccaacaactt gaggggtaca 11040 

cgttgaacag aacctaaacc gttctcgtcg aataataccg attcgacaaa taaaaaatga 11100 

ataaattata ttggcaaaaa aaaaaataga ataaattata ctttattttc caactatttc 11160 

ttactttttt agttttctct ctctctctat aagttatata tttatataca aaaagacgaa 11220 

attcgtaagg caatcttatt ggtattataa ttttctctac tgattatgtc taaccattta 11280 

tacacacaca cacacacaca tatatatata tatatatata tatatatatt attacttgtt 11340 

aaataaaatc agaaaaatgt tgtaatcact ttcaaaactg tagttaataa accttaacta 11400 

aatcaagcaa aaacaatgga taagatggaa gtttagtgat acaaaaatat atacaggtat 11460 

agtgagaata aaaaagttga ggaagtgtga aatctacgtg aagatgaagg atgaaaattg 1152 0 

gttaagttag gttaggttag attttataaa atatgtatga gtttgatttg ttttttaagg 11580 

ttcgagctta gtttatatat ttagttgata tagactacat ttaaaagact gacttaaaag 11640 

tctctttaaa atacataata ataacgaaaa tggattaagt tagatttgtt tttttttctt 11700 

tttactgggt agttagatta ggttaatctt tataaaacat gaatttgatt tgtttttttt 11760 

ttcaaaaaaa attaaggttc aagcttaatt tatttagttg atatagacca ctttcaaaaa 11820 

tctgacttac aagtctcttt agaattcata atagtgacac ttgattaagt tagattagac 11880 

tttataaaac acgagtttga tttttttttt taataataat taaggttcta gcttatatat 11940 

attatatagt tgatatagac tactttcaaa agtctgactt aaaagtctct ttagtataca 12000 

taataatata accttttaat ttagtttaaa aatttgtccc taaataaatt aataaatcca 12060 

aacttatata caagttaata ggtttaagtc ttaatatata tatatatata tatatatata 12120 

tatatatata tatatatata tatatatata tatatatata tatatatata tatatataga 12180 

gagagagaga gagagagaga gggtcatttt atacgagtga gaaaatttaa atattattat 12240 

gaattgtcaa aattaaaata cacacatgcc acatgatttt cttaaaaaaa ttacgtaact 12300 

tttttttaca aaaataatca tatggtttta aaaactaatt taaataattt atatataact 12360 

atatcagtta aatttgtttc ataaaataag tatatcagtt attttacaaa attataagta 12420 

ttcataaaat aaatacaaaa tgataagtac caagtgtatg gatcagctta tgcgatgttg 12480 

ttccaatgta actaataatc ttaaattcga gtattgaatc gaatatgcaa ttcatttaaa 12540 
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tactttaaga gataatttgt ttactcgtaa taattttatt tgacttaagt aaagttttct 12600 

catataaaaa atacatatag tctataaaaa aaacattttc tgactaaata tattttcaca 12660 

ggctccacac aaaaaggaaa taacaaagtt ttaagagaat gtatatttac actcatcagt 12720 

ttgttaaagt taaaattaaa atggagtaga aattgagaga aaagagagga aatatttaaa 12780 

ataaaggttg atttgtataa ataaaatgtg aaggaaagaa ataaaaaact ggtgagtatt 12840 

actcaaaatt atctttatac tgatgaaagg aaaactgata aagaaagtaa acactaaaaa 12900 

aagagaccaa cttacaacta acatttactt cgagcatagt taattaagtg tttgggacat 12960 

aatattgttt ctataactaa gtcttacgta gattaaggtt tacgagacct atcctaaaaa 13 02 0 

catgaattga aactagtatc acctgtcttc tgggctcaat cctggggatt cataaagaca 13 080 

tttcttgaac aatcaaaggg gttatatgaa tggtttaacg atcattaatg tgatattgat 13140 

tgacaaccga tcaatgctag atatataggc ttaaaatcct gtatcagtct acagacgact 13200 

aatatgatgt aaaattcctt agttttaagt gcttttgaac atcaagagac ttaaagttcc 13260 

gtgttggttg acaaacaaat ggtatgatat ataatccttc aacgtaaaca cgaaaaaaaa 13320 

acttataatc tcgtgccaat catcgataca gtacaaataa taaattaaaa tgcaattttt 13380 

ttcttgttct tattttttct tatttctctt aaactagata ctatcgaatc cattctattt 13440 

cttatctgtt tccattattc tacttctcac ttattttcat tactttattc ctttctttta 13500 

tgtttctatc cactttattt atcacctatt tctttctttc ttaccgaata ctaaacaagc 13560 

cttgtgatcc gaaagcccga aacaatcatt ttttatgaaa cagcttacac tctggtggtg 13 620 

tgttgtgtat agttaaataa gcttttaaaa atatggtaaa ttataaggtg agggaccaaa 13680 

atgtgagatt gaaaataacc gtttcatata ttattcaaat aaataaatgg ctaaaattga 13740 

atcattctcc gcatataatg accacccatt tattttatta atatatctaa caattatttt 13 800 

taactccata taggcatttt tgaccctcat cttaaaactc acctcaagaa atatatagtt 13 860 

attttaatta aattagtact caacttcaaa ttaattatta gacaagtgtt gttttttaac 13920 

catttatcaa attaggaact ttatgtcacg ttatcctaaa atcgttacat aaatttttaa 13980 

tgtcacgata caatttttta gaagaaaaat ttgtctgaaa cccatatgac atgggatgca 14040 

ttagtcaaag taacacttcc taaatcatca acttagttag tggcatgcaa catggcgtta 14100 

acctattttt ttttttctag aaaaaaaaaa acatataaat atcaccagct gatgtcgcgc 14160 

caccttcaac gcccagccca gtgtaggcgc actatgaaat caatgcagtc agttttgtca 14220 

tgtcagactt gcaaaaaaat ctccaaaatt atatcacact taataaattt tataaagtat 14280 
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tatgcataat atttttttaa agttattatg 
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attccagttc tgttcatgta aatatcgtag 
tctaaaacaa gaaacgtgtg aggaatctta 
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catagtatct taaaagtttc attaccacta 14340 

tgtgtaggct catatataac caaaaagtaa 14400 

tttattttat cttattttat tttaataatt 14460 

ataattttat tttccctttg agattaaaaa 14520 

taatatgaaa ttagttattt ttatttatat 14580 

agactaaatt -'aggtgggaat gatggtcaga 14640 

attcaaagtc ctaacttata actatattaa 14700 

atcatattca attctaaaag aattgtttct 14760 

aacattactt gggttgattt ttgtagcatc 14820 

attttgacgg aagaaagttc agaatgacga 14880 

ttgtcgtgtc tgagacgtgt acgtgtacct 14940 

gtttattcgg gacgcgctta accaacttca 15000 

gttacatggg cgtcagtcta tttttttttt 15060 

ccaattatat aattgtgaga ttaaatattt 15120 

tttttttaaa atatattaat tatatttgta 15180 

aaattatttt tagttattga tattaaaaat 15240 

aaaggttgag aattttttta aaaatattgt 153 00 

agaatagttt taaaaaaatt ctcattcaag 153 60 

atgatttcat gtaatacgaa aaattgttaa 15420 

actgccttgg taaaaaaatt tcataagata 15480 

tattgtgcca tcctcgttac cttctgttca 15540 

atcacgtaac gtgtgtgttt ggaagcgttg 15600 

atattctaag acacaagcaa caaaaggaag 15660 

atgcgtacgc aaaaacagtt aaatgattag 15720 

acatagggaa atgtcactaa tttgttatac 15780 

tcaaaaacat attaacttgt tctctaatta 15840 

aaatacttta gtgaaaaact catagcagct 15900 

aagataattg catttttttc cttttctttt 15960 

agaattaaga tgctaattta aaaagttgct 16020 
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gagttagagc ataaaagttc aaataaaaaa taaatgaata gacaaactat taaattatta 16080 

atagccttag ccttgaatct gatgcagacg tgtatggcaa tggacataga agcattaaat 16140 

aggcctcgtt acattcaagt ttcaaccaaa ttggcaggga aaatcctctg atactgttgt 16200 

tttcctgaaa ccatcacaat ttgtttctca atcatgtcaa cctcatcctc ttcccaaagc 162 60 

ctcaaaattg gcatagttgg attcggcaac tttggccagt ttctggccaa gacaatgata 1632 0 

aaacaaggcc acactctcac agcaacttct cgatctgatt actctgaact ttgtctccaa 163 80 

atgggcatcc attttttcag gtaagtcaaa ccaaaccaaa ccatgcataa atacatacac 16440 

acttgcacca ttttgctgga aatcccacgt ggatcagtga tatagtcaaa atagtgtata 16500 

taaatagaga acaatttttc acttacgagc tgattttgtg aagttaaagt ctaaaggcaa 16560 

attctaagac atttcatgtt ccgtatgtca aacattacgc agggatgtca gcgcattcct 16620 

taccgcagac atagatgtca tagtgttgtg cacatcgata ttatcgctat ccgaggttgt 16680 

cgggtcaatg ccactcacct ccctgaagcg accaacgctc tttgttgatg ttctttctgt 16740 

caaagagcac ccaagagagc ttctactgcg agagttgcca gaggattcgg acatactctg 16800 

cacgcaccca atgtttggtc ctcagactgc caagaatgga tggacagatc acactttcat 16860 

gtatgacaaa gttcggataa gagacgaagt tatctgctct aatttcatcc aaatttttgc 1692 0 

tactgaggta ggttaatatc ctttgtcaat acccatcaat cacgaaagaa gaaagaatca 16980 

tttttttttt tttttattgg gccagtttaa ttatgttaat caagaagaaa gaaacagaga 17040 

gggtggaagc taagtaactt cagacgtttg catttgataa atcaagatac aagataaatc 17100 

tatgttgtaa aaaatgtaaa gtctcagtcc cacatctaac agaggtaagg attaccatca 17160 

ccttaccctt ataagttatt tgttgatttg agttaggcct aaatttaaac ttacaaagta 17220 

tcaaaggtta tcttggatct attaataggt cacttatcat attacctaca cacaaaaccc 172 80 

aataatgtgg accgtgagag ggtgtattga gaaatatcaa cacgttgatt ttgtgaggtt 17340 

aagctaggcc aagtccaaat tgaaagaatc taagcataca atcctacagc caaggttata 17400 

gaacctcaac gtcattattg aaaacaaatg catattaaac aattcaatct tcaattctcc 17460 

cctggactga gtttgaacta attcttcata agtttggacc attgttgctt ggtttacaaa 17520 

agaatcacaa aaatcagttg tgaatgcctt acatttatga ttcatccggg gtacatctat 17580 

ttcactgtgt tttgttactt ttgtattacg cagggttgca agatggtaca gatgtcctgt 17640 

gaggaacatg acagagcagc tgctaagagc caatttatca ctcacacaat tggcaggtat 17700 

gcagcttcct acatatctaa taaaccattg agaagcacta atataaatgc tcactagatg 17760 
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taacttttgg ctgcttcttg atcaggacac tgggagaaat ggatattcaa tccacaccta 17820 

ttgacactaa gggcttcgag acacttgtta aattggtaaa gagttgttaa catttcccct 17880 

actttctcta aaaaaattcc tttatatgtg gttatgtatc atatggaaaa gttacttgca 17940 

gaaggagacg atgatgagaa atagttttga tttgtatagt ggattattcg tgtataacag 18000 

attcgccaga caagaggtaa tggaactgcc aacgaaatgc ttatttactt ttaaattcca 18060 

ttttaaagca ctgaaccaag taccccaaga tagactcatg aattgtgaaa aatatgcagc 18120 

tggaaaacct tgaacatgcc ttgcacaaag tcaaagaaac gctgatgata caaaggacga 18180 

atggggagca gggtcataaa agaactgaaa gttgatgcat atttatttta caagatattt 18240 

tctctaactc tcaaatatcc tcctgcagtt ccaattataa attactctta tttcagtttc 18300 

cttttaccaa aattgaagtt caattaataa accaaagaga ctggtatatg ttcaatcaca 18360 

tgcgataaaa atgttccacg ttcttgttcc gagcagattc tttgtaattt cataaagtta 1842 0 

gagaaaagaa aaaaaaaaca gacatfctagt cgccaatgcc taaaaccata taataactcc 18480 

acagtttggt tctctgaatg aattcccttc attttaatcc aaatctcaac tacctcttca 18540 

attctaaaca aataaattag aacactacca agtgatcctc tggggtcttg acatgagcct 18600 

tctagttcta gcttttcaat caatgtatct acagagcatg tattccattt tggattatag 18660 

agaaataatg taaaactttt aaccaaatgt tactgcaaat ctaaagaaag ttcattgctc 18720 

catgataaat tgataatact acataagatg tacaactgct gattttatat atcattttaa 18780 

caagacttgc caagagatat atcccttaaa gccaagagca cttatgtttc gatttgagac 18840 

acctctattt attccactta catttgaaaa ataaaaatat tacattcacc aaactgggaa 18900 

atgggaaata tcaaaacgta tagaagtgag ccgtggaagg aattgtaaac aagttatgac 18960 

aaaaaccaga acctattcct ttgtgcctat tttaccagct tttcaagaat agtacaaaat 19020 

taccaagaaa aaaaaatgat gcaacgtatt ttcaccacat ttatttttct ctctcatcaa 19080 

tgctggcttc ttctgtttcg ttgcataaaa ttgcagcagg attcggagga tctgatgctg 19140 

atggctctgt ctttctaact tgtaggccag taaagcgcgc taggtcaatc cattgctgca 19200 

gcaactcacg aaaacaaaaa attttaaaac tcctagtcaa atttagaaat gtatgtgcag 19260 

atgaataatt cacataaaac taaattatac tccattggtt ttaaaacaat tgctactctt 19320 

gattcttgag gttttagtct gtttcaaatt atgtcacttt agaacatgaa gacgacattg 193 80 

cttttttctt tgtcttttat tgtttctcta actaaaaatg aaagcaataa tggcggaaga 19440 

ggataatata gtcagacgaa cttagggttt ccttgaaatt aagaatattt aatgacttta 19500 
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ttaatcatag tagaaaacat taaacaatta ttgtgaaatg gaagggaagt atattgcaat 19560 

atggcaaaag caaatgctaa aacagtgtct ttacaatatt ggctaaaaag taaaaagtca 19620 

aaagaaaaaa gttattagaa aatgttacta ataatacatt tccattgtga ttttcaatac 19680 

atccctaatg taatttctaa taaaagaaat tctattatta attcctcaag accttattac 19740 

aaatgctaat gagtttcctt aagactcttg tcaagggatt aaaaaaataa aatatattaa 19800 

atgaaaagta atgtattaca tgttgtactt ttcataaaat gctctatatt tttggcttaa 19860 

ttatactttt gatcttctta ctttttcaat tttgtaaact ttatccctct attttttttc 19920 

cacaattttg atctctaatt attttaatta tccattaact taacatccaa tattcgataa 19980 

atgtgctgac atggcagtgt agaagagtgt catgtcaaca tgaacgtgtt gacatactag 20040 

caacacgtta acaagaccct tttttattcc ttaacaagca ccttaatggc actcgttagc 20100 

aacacctttg taatattaag atgacacatt ataagacaag cggaagttat tttctttcct 2 0160 

taacttgtga tttattgttc ctagacaaaa gcacaagctg acgcaattat aactcctcaa 20220 

aacacatatt tccatattaa cacttgaatg tgaaattcac cactttaaaa agaaggaaaa 20280 

attaaattag atttttgaga agaattatag tgttcaacca taaataaatg aaatccactt 20340 

actgacgaga ttgtgaacat tgagcattga aagtgaacaa aagcagatag aagataagga 20400 

aaaaaaactg acctgggttc cccaatatga attgactgtc cgtgctacaa aagaaagcat 20460 

atttacaaag aatgtttgag aagcaaactg ttcagaaatc cgatgctcca ctatcccagc 20520 

aattatctgc tcaacaaaaa tattataatt tccgtattaa tacaaaaaaa cagttccagg 20580 

cattaaatgg atgtatattt gttgaatatg atataaccaa agatgtagtc acagaacaat 20640 

caaaaatcaa ttttaagaaa gaaagagcct taaatagtta ttcagcaaag tgcagatgaa 20700 

gaagtagaaa ggaggagaaa attacaaaat cagatataat agatattatt gaaccgagta 20760 

attttttttc cacgattact tttaatagca tccagggttt actaacataa tatttggttg 20820 

gaaaataatg gagaggaaag gagggcaaag atttggaagg aaagggagaa tggaggggag 20880 

taaaacacct cctctatcaa ttttggctcc cttccaaaat tgggagaatt tggagaggag 20940 

aaagttttac atgaattgga ctaaactatc cttaatggtt ttatcctatt atgaggatat 21000 

aataaataat aaattatttt attttccctc ttatttcctt gtgaaccaaa ccaagtgttc 21060 

ctcccctcta ctccctttct ttgaacaaaa tagatagtat acgatgtagt ctaatctttc 21120 

ctattatgct tgctacccac catctacttc aaacactgcc taagtggccc gaaaaacata 21180 

gtgtacaaat tagttattca agacacagct ggtcaagtcc tgctttcaaa atgtctacat 21240 
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gaatttccaa attgtatttc tgtcctgctt 
atcacataat caactattga gtattgaaac 
ttcatttcct acagaaacta ggttcaatct 
aaatgaaaac tttatgcaac aaacaagttc 
tattccaaga aagcatgcat atataaatgc 
ttgttcagtg acaactgctg gattaaggac 
agccacagct cctatgcttg aaataaatcc 
aactgatgca atcctatgat tgagattcca 
attgtccggg atggagccaa gaagtcccat 
ctcatcagca tatgaaagga atgacaaagt 
aacaactgat cctctgacag tgtccgtaat 
tttccgcttt tcatactggg ccaataatgt 
taatctgtgc aagaatttgg gatctgccaa 
aaccagatag tgaagggaac aaggtgatga 
tcaaataaaa aattacctgg aaacggtgag 
gatcaaagaa gttctgcaat acctctggtg 
gatcctgaaa ttgaaaggaa aatatttcct 
cctgtctaag gtatgcacat aaaagagtaa 
agaaacacac taactaccta atcatcaaat 
tagaccagag gtactttatt ttaagagaaa 
aaaacagaaa tagcgacctc attatcacaa 
aaattataca ttgctgtgca atgaatcttt 
gaaagatagc acaaagacag cactacaaag 
cagtaggttg agctgcatga tttgtgtcaa 
aaagggataa ggacccattt atttaagctt 
tgtaaagtta ttttatttgg ttacaataat 
gttataattt tgactttttt tcagctgcta 
gatagataga ttctgatttt ttttctaaaa 
tagaagtgat ttttcatgaa aaaagttgaa 
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caatactata tattggaaaa aaatcaaggc 213 00 

ataaaccatc gacgtgacaa tatagaagcc 21360 

tccatgaagt ctattgctac atattagtta 21420 

tctatatacc tgataacgga gatttgcaga 21480 

agttttgagt attggggacc tcataatccg 21540 

tttacgaatt gcatatagag aatttgatga 21600 

aacactagca agttttaaac caccaaatac 21660 

gtttatccct gcaggattct tttgaaatgc 21720 

tagagaacca atgttgtcgg gtgctttcat 21780 

tggtgcagga agccacactg taaagaaatc 21840 

aacatagtca atctcttgga aaaaattttc 21900 

agttgttatt gatatagctt cttctatggc 21960 

caatctttcc ctgaatccct gtataaaaag 22020 

atgattaaaa gcagaatgaa ggtttcattt 22080 

tgagctctga aattagagga tactgctcca 22140 

atactaaacc aagatcaatt cccttttgaa 22200 

atgttcacca ccaatattgg cacacactat 222 60 

acagaagcag agaaaaaaga actagaaagt 22320 

gcaagtatgc aactaatgta tgccaaatta 223 80 

gaaaacaggg tactttattt taagagaaag 22440 

ttcacccatt ggaaaagttt ttatgtcctt 22500 

cctcaaaagg aatatgaatt taaaggaaaa 22560 

ttgcaagcat tcaattaaaa tcccccacac 22620 

ttaataaaat gcaaaacaga gatatcaatt 22 680 

ttaaaaaaat attttttttt acatatttta 22740 

taaaaaatgt actttatatt ataaaaagta 22800 

ctcaaagtag cttctgaaaa taatcatata 22860 . 

aaaaacttaa acaaacacac taagaaattt 22920 

acaaatgggc tctaaaatgc tcctgaaatg 22980 
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ccaaagttaa ttgcatacaa aaaaaataat caataggtac tggcacaaga cacctagtaa 23 040 

tatgcgaaat ctcttatgtt tgtatcacca aaatggacaa tgagaggaca taacaacaac 23100 

aacaccacca aaaccttatc ccactaggaa tgagaggaca taaagggcta aaaattggaa 23160 

ggagggtcta cggggcaaga agattaacag tcaaacaaat tagtaactgt aattgttggt 2322 0 

ttacctgtgg gagggcatct cgcctccgcc cagcagcatt cataacccga gcaatctcag 23280 

cacggtcaaa gcaatttcta ctacagggtc tcgcagcaga ataccacaaa aaatcagcaa 23340 

caggaacttc tccttctctg cgaatgaatt gtctttcagg gtcaagtaat ataactgcat 23400 

ggtttttctt ttgtattttt cctgaaattc ttgctggcac tccagttcct ctagatccat 23460 

atgtaacatg gcttgcacca gtgacaacta ttaacatacc agtgacccct ccatcaagca 23520 

cattttgtaa gataatctgg gacatagaat actcatcaac tactcgagcc tgtgcagaaa 23580 

ggtatgagct tggaccaaaa ggaatagaca gattttgagt actatcaaca gaagatctgc 23 640 

gtgagataga agtaaagcca gatatgaagc ctgaaccagc tggaggtgca tatagtttac 23700 

gttcatcctt tgtaagccca cgaattcctt ctgcttggac agttcttaag atctgttgat 23760 

gagaaaagtt caagtcttaa ttgtcatctt tatgtgtctt gacctaacaa tacaagaact 23 820 

gagcctaatc ctacaaggtg gagttggcta gatggatcaa atgaaacctt ttagctctat 23880 

caaaaaccaa aattgcaata aagtttctca ggttgccctc taccctttca acagattata 23940 

ttcctttctc atatcttgac atgatcaaac tacttgaggc aacttttcat aatcatatcc 24000 

tcaatttttt tatgtttaac tttcagttta atatgggaag gaatctttga ggattatgtg 24060 

tttatttcca attcagggtc tgttctttgt agactagatt tttctagtcc taacgcaacc 24120 

aaaatcctta agggcaactt tttacagcac atagcttttg cacttattgg aagtcaattt 24180 

gacaagcacc aacaagtata tactatacta ctccctccat tccaaaataa ttgttgtcct 24240 

aaattgtttt acacagacca agaaaaaaca atagatagat gaaagagagt tgtagtttta 243 00 

caaagttaat cttatatcat cattgattca tttatagatt ttgtttccat cattaatatt 24360 

ataaggaata tacgtgaaaa aatgtaatta aatattatat tcaaaactaa aataacaatt 2442 0 

attttggaat aatatttttt tcttatacga caattataat gggacagagg gagtaacatt 24480 

tttctgttgc tcctaattat agccacacca caaccataat tttcagagac aaaataaaca 24540 

tttgaaagat caacatgagt ttggatgaaa tttatgcata ccttcagtgg tgtaccacaa 24600 

gcaacaagat gaattccatt ttcgcgacag tagctcagaa taggttcata ctcctgccat 24660 

ctttgaggcg gccaatgcaa cgtgtaagac ttcaaggtgt ctccatctat cctgccatga 2472 0 
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aagtcaagtt tcaggacaag taatgcagaa ttatggaaaa gcaatctgac taagacaaaa 24780 

gagcttcaga gattaacaga aaatagtgag ccagaaaaaa gattgcgaga cagaaattgg 24840 

tcgccaacaa aaagttgtct cttttataat ttttaattga aattttctta atttagctaa 24900 

catgactttc tacggccaca attgcgtttg cagacactta aaaaacttga tgttgcagca 24960 

aaaatcacgt tttatttatt attgatgtca attatttaac agttttatgt taggtttaat 25020 

aacagtaggt tgatgcaaga • ggctaaacat taatcagaaa ttgaaaggca gtgttattac 25080 

ttcttatcca tatactgatt gagcggttcc tgaagattag cgggaaaaac ttcaagcgcc 25140 

agagacaata gtttttcctt ctccaaacag cgcctatgca aattcttcac aatctcaagc 25200 

tccaattccc tatcgtctcg aaccggaact tgctctgctt cacctaaata caccactcga 25260 

gcattcatca acttctccca cactttccct ttctctttcc ctatcgccaa cggttctcct 25320 

atcaccgtcg cgtcgtaaat cctcgaagtt atcacttcct cctcctcctt cttcttcggc 25380 

ggctcctccg gcttcggcgc cgaagccaga ggagattccg ctgccttctc ctccgccctc 25440 

gccgtcgccg ccgagagcag gattgatgcg ccggcgacta agaacggcgc catcagcacg 25500 

ccgcgccggc tgctccgagc tcgagtatcg ccgtcgccgt cggaaccacc gggattcgag 25560 

gcggcggtga cgcgactcgc gtggcagacg gaaaggctga cgcggcggcg tttggcggtc 25620 

gagacgcggc ggaattcgag gcctccgggg gcgtcatgag gagggtccga tggcgcggca 25680 

cgtgcggcag tggcgccgcg gaagtaaggc acgtgcggga gacgagtgac gaaagaagaa 25740 

gccggagtgt ggggcttcat tagtttcgtt ggcttcagtc tggccttatc atcaaccgca 25800 

acaaggttta agttttgtca gttcactttt ttcaactgcc acacaaaaca cgaacaagga 25860 

ttcctttatt ttcagcatta caagattcca taattattat tattattatt attttggaga 25920 

aacttttttt ttttcctttc aaatatccct tttcatagac cattttaatt gacaaacaat 25980 

taaacattaa ataaaaaaac atttattatc aaacagtaga tcaatttagg tgtgttagtc 26040 

taaagaaaaa caaataaata acaatgacat gttttgagag atagacaaat attccctcta 26100 

acataacata aaacatatat atatatatat atatatatat atatatatat atatatatat 26160 

ttgtggtaga aagaaatatt agttttagtt cgacatatta taagatatta gtttctttat 26220 

gtgaaattca tgttctttta tttattatgt gtttaattta aaagataaaa ttatataatc 26280 

taaaaataaa aaagaaattt ttgaattaaa ataaaatata agagttgtag gattcatatt 26340 

attttttcct caaccaaaaa gacaccctaa aaaagttacc aatatacttt ttaacatgtt 26400 

attttaaaag tatttttatt aacttaaatt tattaaaaat tataattttt tttaatttca 26460 
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tcttttattc aatgaatttt ccctataatt tgattgtttt caataaatta taatcattat 26520 

taaaaaaata tataataaag aaagtgtatc aagaaaatat atcaagaggt tattcttatt 26580 

tttgcctctc ataaaatcct agacttaaaa tatacactct actttttgtg ttcaataata 26640 

aattcgtata tttcgtggtc gcaaaaaaat taaatgatat tgataattta catagtcacg 26700 

caaactatat atagaaggcc tcaaatttag agctctacgt gaagcattgg tgtccataga 26760 

gttacatggt gccaattacc atatctttca ttattttgga attttcatgt aagataaaat 26820 

cgatccgagt cagatgtgac tcaggtctga aacacaggtg ccgatccaat atccatgtat 2 6880 

gccaataata cgaggcgtct actctaattt gattgaaaat aagggggcaa aaagtaaaat 2 6940 

atatactgcc aaattccaat tcaattcaac tatacgacat tgtctaaaag ttagaccaaa 27000 

ttgaccaact gaagtgatac ctctttgttg atataaaaaa ctcgtgtgtg ctatgaatat 27060 

ttttaaaaca aaataatatc tatgagataa aaatatcatt attgtcaaat aatgatgaaa 27120 

tactatcatg tattttaaga agaaaaaata tataatagaa gatgacttta ttgttaatta 27180 

tgtgaattct attttattat ataaaataaa ttaaggttat gtttaaaaaa attagttgaa 27240 

agttaaaaaa caaactaatt gataaccaaa aacttttaag ttaatttatt aaattataaa 27300 

tatttgatag aattgttgtt gaagtagata aaaaatataa tatcacaaaa atagatatat 27360 

ttatatgata tttatataaa ctttaatgtt ttatggacaa aagtatattg aggtattata 27420 

attttatttt ttaattaatt ttaaactctt gtaaattatt tttcattata tcttttgttt 27480 

caattattag attttttcat gttgcatatt ctattattaa tcttgtacaa tgtcttaata 27540 

tttttaacca agtttgaaat aaagttgaaa aaacatgcac ttaagtaaat attatacttt 27600 

tattatatta attagtataa tagttaaaat aaattgtata atataaaaat attcagaaaa 27660 

taataaatta ttttaacatt ttttactgtc aatttcgtga agatgttgaa ataattacaa 27720 

tggctaagac aaatagtata atataaaaaa attgtaagag gaatgagtga aaaataaata 27780 

aataataaaa ttatgttata tttaaaagga ataataagaa tatttaataa atattttaag 27840 

aattaaaaaa taaaatataa aagcaaaaaa ttagaggcta aaaactagag ttttaaaaaa 27900 

gttactttaa ataatgtttc agaaaataat aaaagttaca aaaaaatact tatttattaa 27960 

ataattaaac aagtttttca actaataaaa aaataaaact aactaaaata acgtatcaaa 28020 

catagcctaa agcgaattta aaaaaaaaaa ttaaggtgga aaagcatcaa aattcaaagt 28080 

tggtatcaaa attaggatta actaaattta agcaataata tatgtatcct tttcctctcg 2 8140 

gccccaaagt tcatgatcaa tcttatcaaa cctttttaac ttatcaattt tgctttcatg 28200 
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caaaatctag taagagtaac ttcaaattaa 
aaatcatatc tatcttgcta tatgtaataa 
agactttcct aataaaataa aatgtgggta 
aattggaaaa ggcatgcatc attaacaatg 
tctcaagagg cagggacaaa cagatttcac 
ctcactttct catggtttag ccttaaaaca 
ggaatgtgac tgaatagttt ggtaactcag 
ctattagatc ataattttgt tgagaataca 
ttctatttct ggtttcaaat ctcaaaaaca 
aaataataat aaaaaggcaa gctagcaatc 
tggtcacaga atagcttatc cagtacaatt 
aacaacgaac ctcctttacc caatatttag 
ttgcgaatag ctcccctaat aaaaaatgcc 
gctcacttga tatcatgaaa agaaggcaag 
ggctctgtaa ataacaaaca aaaaatagta 
tgagtgtagc tgccataatt gatggcctaa 
gcattgacgt tgaacttcaa atattgcaaa 
aaaaagcata gcagaaatta agaccttaag 
catatactga ctagaaaacc catgaaaacc 
aatttatcat gaatatcctg ttactttcca 
gatccccagt gtttggtatg attacaaaaa 
tttaagatat ttttgtagtt ataacttata 
tattaagcct agatacttat gcgacccaaa 
aagatacaac gcaactcaga tgcattaaca 
gagagacata catacatact ttaaataaat 
cttacactaa tacataacta tatcagtgga 
atgataataa caaaataaaa aaacaaaaaa 
agaatatatt attttccaac atgctacaaa 
cagacatcat catttgtcct actcaaaaga 
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atccacgttt gatagtgaga ctcaagttta 2 82 60 

tcatgtttga ctgctgagtt tgatgggtca 28320 

cggatgctta gttttgatgg gtgcaaatac 28380 

ttttacacgt ctaatttctc cccctctgat 28440 

atgccctttt ctgggataca aacatggttt 28500 

tgattcatcg caatctgccc ttaccatttg 28560 

aatttgctac aatctggtga taagtaatga 28620 

atatttctat atattctaat actacagtgt 28680 

agttatacac aaattctttt gggagaaaat 28740 

aaactccgca actaaagata cataacgagg 28800 

taagaaattg gtatacaaag tatgattttc 28860 

tcacatttat ttgtaaccta ttaaaaactt 28920 

gcatgattaa tcatcaacag gaaaaggcta 28980 

acagcaataa gacggtctcc ataaccaaca 29040 

aggaattcat cacaactatg gatgactgat 29100 

aatatgttta aacattgata atttggttga 29160 

aggacggaaa tcgcaatgaa taaatcactg 29220 

taaacaatat ttttccattc agtcaatagt 29280 

gatatactgt aaattacaat gagctaaatt 29340 

tcatttagca aaaaggtata caagattcaa 29400 

agtcacatta ttttccactt tgttttctgt 29460 

agagaaaaca agaaatgttt tctcaaactc 29520 

tacgggggat acgggaaatt cttaaaattc 29580 

caaatataca cacacacaca aataaataga 29640 

gcacagtatt tattaagaga cattgattat 29700 

cgatgatcat tattcacaaa agcaatacct 29760 

acagtgacat atgtttccat tactcatacc 29820 

atcttccccc taacttgtgt caaaatgtca 29880 

agaattgact ctatatgatt gcctagtagt 29940 
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aatatttagt atttactata gctttaaaga taaagctgta ttgaatttta ttcctattca 30000 

aagtactgga gccattctca accataccaa tctacaaagt tttggaaaaa aaaatagagg 30060 

atactctttg gaattggata agtacaagag tatcatatgc gtatcggtgt tgcatacaag 3 0120 

tacagcatag atactttgtc atttttggag tatcaaggct tcacagctca aactaaacaa 30180 

atcaaaccca acatccccac gttttaatat gataacagca tgctagccat aaagcaatta 30240 

ggcatccatg atactagagt atcataaata caggtcagaa aatgctcaag tactaagtgt 30300 

tccaagatgg taaaattcat ttttaatgct tatcactgtg actacgagaa tttagaatta 30360 

ttaggattgc caaatcattt ttgccatatt ttattgctag aggcacacta ttgctttaac 30420 

tatttcaatt ttggatgaac agcacggcta tcatcactct ttctttcccc agaagccctg 3 0480 

tattacttta gtaccatgta aataaatcta tacattttgg taacaggtca tagaaattat 30540 

tatacctcca tcctcaacaa gtagcttcaa cacttctcca gccacatcag actgcaagat 30600 

ttcaaaaatt aaactagtca gaagtagtaa atatttagga aggaaccaga attacagaaa 30660 

cagaggcatc accctgatag gaagtccagt gccaaactga tccaaatacc ctatgacttg 30720 

cccttctttg attacatcac cctagaattc aattaggaaa taaatatatt gaaaagaatt 30780 

tgtagtcagt tcaatgaaag tgaggtcctc aaacaacttg atgcagcaac tgtatgatac 30840 

aaaatatatt aataactaca ccagcagaaa aatataggtc aatctatatt tgggaaccaa 30900 

ataatattta atttgtatct gatagactca agaaattata actaatttgg aagaaatgga 3 0960 

tacctagtat tattaaaaca ccaaaacact gggcagatta tagtagctaa agaggaagaa 31020 

gctaactagt caaagtgtca cactattcaa cactacaaag gaccaatccc cttttagaga 31080 

gcctgacctt tctcacccaa gagctaccca agagaataca caccctctcc tccatatccc 31140 

ctcccatata acacaatcct caccaactaa gcacctacct gacaattccc tcctaaccaa 31200 

ctctctgctc atcagggttg attctcttct ctttccaaga ctttgggctt ttgttttgac 31260 

taagccaaat ttctatctgc tggcctggtc caacagtatc ttttacagac aagtttacaa 31320 

aatattcgta tttgttagaa tttattgata ttcctattat gtccccactg tgtgcaaaca 31380 

tttagaaact aatattacaa ttaacagttt ttgtgaatgc agcaaaacta aatatatatg 31440 

atatagaaat caacaaaact gaaaaattat atgcaaagtt caattgaaaa gaaaattgat 31500 

tacccttttt gtggtaataa atataatgat aaactaggta ggttacagtt tggatttgtg 31560 

atcaattgaa gatctagatg ctaattggtc ataactacaa tattttttgc agtgcttctg 31620 

tgcctcacat cagtcacgtg tgtctatata acttgttctt aaagtaaata ttaaaataat 31680 
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ataaaaaata ttagaaattt aaattatatt 
tttatatcaa taatcaattt tttaagataa 
aaaataatta tcaatattgt ataagctgag 
ttttttatag catccagttt tttcagttaa 
tgcatgtcag caaggtcaca tcaggagact 
actgcaacta ttagtttcag cactgcaaaa 
tccaatggtg gagttcataa tgagaatcca 
tctagttcat aacaaattga tgacataaaa 
cttatcacat gtacctcttt acagatagga 
cttcggaata agccaaccta atgagaaaga 
caagtcagag ataattagtg aaagcagaag 
ccgtgggaga tgttactaag acataggtat 
ttggtgattt ctctttggaa acatttgcaa 
gtggtgatgg tggcaggcta ttgggtgctg 
gtggtgtagt cggtgaaatg ttagacaaag 
tatgcatttc aaaatctcca acctataaca 
catccagcta aaagcatgct tcttttgatg 
tattaaagtt attaggcaca atgttgtatg 
tgatcaaatt tgcattacaa gaaaaactat 
accatattct caagtttcaa ttacaatgtg 
tacctcattt aataatctat aagcctgtgt 
cataagttca atacacagta catggcaggc 
tgcaaagaac cctaccatat aacattttgc 
ccttgttatt attatattta aaatactgca 
gagaagaaga tcacctttac tttcagttca 
accaaagcct gataataagc aaaccagaca 
acatggctgt taacgaatgc aattatgcag 
aaaatacctt gtatttattt cttctaataa 
acaattgcca attcccacca aactatcatt 
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tacatttttt aaatgtatta gtaagttttg 31740 

aaatttacaa aaaagtaata cagtaataga 31800 

actatcattg ctaattatta tcagcttttc 31860 

atacttaaat ttatttcaaa agccaacata 31920 

aggccgacaa gcaagcaggc tgatacatag 31980 

tgttagtgaa caacaaacac atgcaccaaa 32040 

gatggaggga gtatcaagtg aattatgtga 32100 

acaccagaga agaggatgga tcttgacaag 32160 

ggttgcttct tccctttcac tgttctacct 32220 

aagatctgtg atagctaact ctacatagct 32280 

ttaaaacata tattattgat ggatgccata 32340 

tggtcccaga agcctccaat gctgccaatt 32400 

atgggttgtt cttttcagga gatgattttg 32460 

attcatccat aggtttactt gggataggtg 32520 

gaacctttgt tgctccaatg tttcgcttaa 32580 

atataagcca aataaggttg tgtacaaggt 32 640 

aaccagacag aaattacaga tttgaagtta 32700 

aaaacaaagc tgagctgaca caatttttta 32760 

atattcagat aggacattag gcacaatagc 32820 

ctaccatttt gataaaaaca taaataccat 32 880 

tatcttaagt gaattaaaat catcctttga 32940 

tatagacata tccagacact aattacgaat 33000 

tttagaacat acacttaaag tttttttcat 33060 

cttcaaattt tacagaagca gtaaacaaga 33120 

gcaatttcag tctcatcaca gacctctaat 33180 

taagttgtgg tttatcagat tcacattgaa 33240 

tcttggataa aaaccagcta tcataagtaa 33300 

ctaacataag gtatttgtat tcatgagcat 33360 

tttcactaat aagataagat tttcaggaaa 33420 
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ataaaaggaa agaatagcta tagacctcaa atccattagg aaaagtagca gtttgcaaag 33480 

gctttttctc caatgagcct tggggggtgt tatctgaaga agctgcaaca tcataaataa 33540 

ttgtaaggtc atggtttacg aatgttaaag taaaagttgt taaagggaag ctattctgat 33600 

agccaactat aacaaagctg tatatatcat ataaccttca aacagacatt cacaattacc 33 660 

ataataataa ttagcaacag agtagaatga tttcctacta ttgtgtatot atgtaaataa 33720 

cataatacct taaaaataaa cagataaaca ataaaaacaa catcttttgt ttgtaaggat 33780 

tttcttctat cttctttctt tccaacaagt atataacggt tcggcaacat tatctttgat 33 840 

catgtatata aaaatatgaa attgcagccc aaaaattttg caactggtta atgtatctca 33900 

aaatcttaag aataaaaaca acttaaagtt tattactaag aagataatta cactaagaag 33960 

tatgaacaaa taagagtatc agacttttct cattcaaaca acagccaaat agaacaaaaa 34020 

cataaggata taattcattt cacaatcaat ataaacccac catcagaatt ggatgtgttg 34080 

atagcttcag ctgtttttgc agatgaaact agtgtgcttt tccccttcat gtgggaatta 34140 

atgtgcttct gaccatatgc caaatgctgg atgaaaagcc tacgtttaga gttccatctg 34200 

gcattatgga tgggaagtac agcttgcttc tcaaggcagg ctcgcacatg ggacatagtg 34260 

cccatgggat ctgaaatcca agagtcatga ttaagcatct ggttgcaatg gtaaaatttg 3432 0 

cccctttaaa aatataaaaa acattgaagt attacaaaaa tattgtatta tgatttatga 34380 

aaattgaccc tttaagaagg cattaagaat ctttccaaaa agaagtaagc ctatttcaaa 34440 

aaggctcggg aaaagaacgg aacacattgt tttatttaac agaaatggat caagacaaga 34500 

aaaaaaaaaa aaccaacata acacaaatat ttctgacaag tgtttccaat caaaaatagt 34560 

atacagccta tgggtaccac caaagttaat aaataataat ttaaaaaaaa gaaaaacaca 34620 

gtcccttgtg tcctactata tgacccaaca gaatgccaat tgcatgctct taggtgattg 34680 

cagagcatcc tttggttata tatatatata tatatatata tatatatata tatatatata 34740 

tatatatata tatatatata tatatatata tttgtagaaa catgaaggat acattcaact 34800 

gccttcatga aaatgttatg ctcgtgattc tgataaatac gtgacactta gcattagcat 34860 

aaaataacct gtcaatgcta cctcattttt ccgttaattt ctttggtgtt attcttcaac 3492 0 

tgtttttcaa tttgattttt cctcatatat gtcacataaa ttaaagcaaa taaaaccgaa 34980 

aagcaagaga gcaagatcag aagtcgcaaa cacacgagca gagatggcag tcggcaaagc 35040 

acgttcataa caaaaaaaaa atgcaggtag agatgaggag agagagagag ttacagtgaa 35100 

aggaacgaat ggcaggcgag gattccatgg gaagaaatgg aaatggaaga atgggaggga 35160 
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aaaacaatgg aggaggagag gaacttatag agaagagaga atagccaagt tgagttaagc 35220 

gaatgaggaa agaggtcgat aaattagttg cactgtgtct gctttgagat ttccgcccct 35280 

ctaatcacct tctccgtttc aatctaggaa cattgcctcg ctaacgtgcg ccggtgtgtg 35340 

actagtgctt ccctcctccc tcttacagtc ttacgtggga cccacccctc caggcaggta 35400 

ggtttcatgg acagccatac aatgaatagt tcaaaaagtc taatttagta gtttcttgtt 35460 

actataattt ttttatgcag tccacactaa taaaaaatta gatggttgga aaacaaatct 35520 

tattacaagt tttataggta aacttgaaaa actctatgtt ataagacctt tttctcactt 35580 

tggtagtagt ctcttattca agttagataa ttcttcttat cttaataata atattttttt 35640 

tatagtgata catggatgtt atttagtggg attttattat ccctctctcc acctactctt 35700 

tcattatagt aatgcattct tcaaagagtc aaaatatatt tcattacttc caagaataaa 35760 

ccttttaatt ttggatagat ttatttttta gtcttttaat ttatttattt tttagattta 35820 

atttggtcct tcagtttttc agaattcaat ttaattctct aattttttaa atcgatcaaa 35880 

tttggttttt caatctaaat tataagaaac tatattttgt gatggtttaa aatcgccatt 3 5940 

aagtgttctt aagctaccac aaaaagcaca tttccaaaaa aataaattga ttttaaaaat 3 6000 

tataagatca aattgaatca attttaaaaa ttaaaatatt aaattgaaaa aaaaataaat 36060 

gatcaaattg aacataaata ataaatttga ggattaaaaa actaatttaa cctttaattt 36120 

tttctcactt atattaatat taaaaaatta tattgatttt cctaataact ccttatctca 36180 

attaaaattt ccaaaaatta attctagcat cttcaaacac tactcaccat gaaagttcat 3 6240 

cacaaccatc tttctttctc ttttctctac atcatgtttt cgcttcgcaa actttattgt 36300 

gttcctagtc ttagacgtct gataatcttc cacaagtatt gaactataac acttattgta 3 63 60 

cttgcaccgt taatagctaa caccaaatga gacgtgtcac ttgactttta tatcactaag 3 642 0 

aaaatttcaa cacattgatc cagtattagc tccatcttgc tttaacactt gtttgactag 3 6480 

tcacttaagt gcaacaacca actttgatat cattgttgga aaaataaacc ttattagaag 36540 

tttcctagac aaacacgaga aactctttcc attacaagac tttctctatt acttgggtat 36600 

ggtggtgact tcctttataa tggtggtgaa tagctccatt tataaatgtt atttagtgag 36660 

ttttaattat atcatctctc tatccacatt ttcattacgc tagtaggatt ctccaaaaat 36720 

caagttacat tccattttac gtcacctctt aattttttgc tcagttgctt taatatttgg 36780 

aaatttggat tggttttcac aacatatact agatataact tttaatgtaa ttcaaaataa 3 6840 

caattcttga taaattgatt ttcacaacat attcatatat actcaaaatt taaagataag 3 6900 



95 



WO 01/51627 PCT7US01/00552 

tactacattt tatcaggtgg gtcaacacat tttacctccc cgatcataga gtgatcaaga 36960 

ggaaaaaaag aatgaatgaa aggaggagta aggtgaagga aagtaatgaa aagaatgaga 37020 

caacttttaa aaaattaaaa ttaaggataa taaatttatt tattgaaata agggtttatt 37080 

ttaatcaaat aaccaacttt tttgtttttt tagtatgttt gtctaaatta ttatttttaa 37140 

aaaataactt tctgtttatt ttaagaaaca aaacaaatct tatttgcttt tttaaaaaat 37200 

acttatataa aaatatttat tttttttttt ttttaagttt aaacaaactc atcctaatat 372 60 

gaaccagaaa accttagttt ttgttaacaa aaatgagtta aaataatttt ggattcattt 3732 0 

taaaactatt tctttgtctt tttagtcatt caaatgattt attgataaaa aaatattcaa 37380 

atatttttgt tttcaaattt aagtgaggag tgataaaaac acttttttta ttggatagaa 37440 

tttattaaaa tttacaaaaa tcatgagtga agttagaatg atacatacat attttgtcat 37500 

ttccaataat htttagtgaa aataaattgt attaaaaagt gtgttgctat ttttagcact 37560 

gagagtcatg aacatggatt tgctccagga gtgataattt gtggaatcaa gtgagggaga 37620 

aactcatttt tcaatttaac tttaaaaacc aaaactaaaa aacttacaac tatacattgt 37680 

attaattagc atgtgtttta tatatatata tatatatata tatatatata tatatatata 37740 

tatatttgag tatggaagga gtactctatt caatgagatg aatatgtgtt aacaaaaaga 37 800 

ttgattaggc gattaagaaa gaagagagat tcaattcctc ttactactaa aatctaataa 37860 

actgataatt aacatttgtt aataataaaa aaatgaaatg gacatgcaat taattaggcc 37920 

aatgataaag aatatattta aaacaaattg ttaaatacag tgtgtttgac aatgatatat 37980 

aatcgtgtcc atggatcata tcaacctgaa actaatgaaa ggatcacaga tcacttctat 3 8040 

cttccaatta aggaatcaca ggtttaaaca atagtagtag ttaatttgca catcactact 38100 

ctggaggcaa ggctaagcaa cgtcgagatg gactatttct cgaaactcca acctcctagt 38160 

ttcactgtag aatgtcacac attttgttag accaatatgt tagccatatc caacccctta 38220 

tcttccattc cgttgttttc cctatggctc ctttgttcac tttccaccac actttttttc 38280 

catattcatc acctgcgaat ataacccatc cttccttgtt gaaaggtcag taagcactgg 38340 

agtatcgatt tttgaaccat gactttgggt taagcattgc aaacttcgaa ggctttgatg 38400 

ataaaacttc tttgccatca acgctgtcat gaagaaacca attcaaattt gcactgtatc 3 8460 

cataaactaa cggttgagga ctctctccag ctgacacctt aagtgccaag ggattcgatg 3 852 0 

cctcaaatga agcttctacg cttttgtcaa caccaatttc ttttcgtggg ttctctgagg 38580 

atttgccaac tgataggctc aacacttggt cttggattgt gggtgtaagt tgcaatgata 3 8640 
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ttctggaagg aaaatgtttt tcagctgcac cagcccctcg ttgtttcata aggaactcat 38700 

tcaccaattc gataacatca cacctgtcca catggaatat tgtaaattat attggagatt 38760 

aagaatttac gttaaaaatt aagagtgaat ttgtttaaac ttataaaaat taatttaaaa 38820 

atagatattt ttttatataa aggttttttt aagaaataaa taagattttt ttatatatta 3 8880 

aaataaaaaa attatttgtt taagaaaaaa taatttataa aaatgtttta aaaaataaaa 38940 

aattacttat tttattaaaa tcaatattta ttttaacaaa taactttaaa taaattgacc 39000 

ctaaattgta cacaattaat taagaaaaac agttactcaa catatatata tatatatata 39060 

tatatattac aatcttattt ccttaggtga gatgaattag atagatcaca tatcatcatg 39120 

actcgattaa taatctcctt aatacagtga agtaaatata taaagataat gcgattcata 39180 

aaaagagtga ttttgatcac ctttgttata aaaagaaaaa gaacgatgct tttttttttt 39240 

ttgtaaatat aaaatttaag tgacttttag atatttccaa tctaaatcca aaatattgag 393 00 

ctataagaaa ttgagtttaa gttgaacaaa agtatatcat agaaggaaga atttagagaa 39360 

aagaaaagaa aaaagtacat accacacctt gtaaacaatt caagctaatg agcccaaaat 39420 

tgtacaaatg atgaattttt ttttcctaaa cttctaataa caacatgggc ataagctgaa 39480 

taatggataa tttatttgtc ttgttaagat gcttatttat tggctctata tgtattatct 39540 

cttaatggga tattaatggg ataatattag tggttaatta attagtggtt accttacgtt 39600 

gtcaatgtcc accatgattt cagcccactc gtctttgatt aagactttgt gattgaattg 39660 

aagaacgcct tccacatggt gatatctgag ggagaattga agcctctcgc tagctggact 3972 0 

gctttttgaa ttctcaacat ccatgaccat gaggtgacaa tacagctgca ccttccacaa 39780 

cccaattgtt gaaaatgcat aagaaaacaa aggacacggt gttctgaagg ggttgttatg 39840 

tgcttccaag ttccccaacc cagttactga ccgcaaggtt caatgaacgc atccacagct 39900 

cttccaagtt agaccctaag agtttcataa tcatatgtga tgcttgtcta gactgaaaac 39960 

ctatcaagtg atctttcaag ttactaatgc acccagaacg aaaatctgca gcaggagctt 4002 0 

catatatgta aactaggaac aagagggtga agaaagaacg tctgaaaggt tgaaaacgtc 40080 

tgaaaggttg tcggaagcag tggagtcaag gtttggaaat ctgatgaagg tagtgctgtt 40140 

tttgttggag ccataaagaa ggatggcctg aatgaaatta gcaaagacat tgtaaatggt 40200 

ttcttcatca tctattaact tattattggt gtttttagta gtgctgctaa ttggcttgaa 402 60 

aggttttgag gtctaaagat ggatgggaat gttgcagtct gcaataatgc caagatagag 40320 

gtttgaggaa tgactattct ttttgtgtac ggtgagatta agacgtggtt ggcatgagct 403 80 



97 



WO 01/51627 PCT7US01/00552 

tgaagaagaa catatgttta aggacatgga acttgtttcc cattctgaaa ttggtggaag 40440 

gttctgaatc cagcaaaaca catcaagaaa gttgttagcc atggatcgaa tgaagcaact 40500 

taattaatat aactctctct ctctctatct ctctaattcg gttgcattca ggtgtggctt 40560 

cacatttatt tgtagactct tacataatgc tatgttatgt actgcaatta gcaaatactc 4062 0 

tttctagtgg agaaataata attaaaaaag tggactgatt ggtacgacca ttagtttaat 40680 

tagctccatg gagaaaagca agataaaatt gctaattatt ggttaagaaa ataattgcac 40740 

cagatatatt atataaaatg tcaaaaacgc attccgtaca ttataaataa tattatatac 40800 

gtcatattta catcattttt tatccttgtt tatctcaaaa aagtgtaaat atagagagag 40860 

tatatatcat atcatataat atgtaagttt ttattagttt aaaaaaatag cttgagagta 40920 

atgtgatttg tcatgtgcta ataaaatatc attttgaatg ctcttttatc cacatatatt 40980 

aattgttaat gattgaagtt tattattatt attataatat ccttttaacg atgaaagttt 41040 

gttttaaaaa aatatagatt taagatgtgt ttggaggaat ttatttatat cttatctgaa 41100 

cttattttat ggcatacgtg taagtattta agaaaactta taaaattata gtttatgatt 41160 

tatttataaa ttgttttcaa cttattttaa taaaattttc aaaataactt ataagaacaa 4122 0 

attaaatttt ttatatgaaa ataatttaac cttattttct tttcaattat aaaaaacaat 41280 

ttacaaataa aagcttatat atatgataca cacttttaag -tgtttaagta agctatctaa 41340 

aaaaggccgt acagtgtttc tttaatgaac tatcgatcgg gaatgttata tatggaaata 41400 

tatatacttg agtgaatata ggctcgatta ctccatagta cagtccaata attattagta 41460 

aacgaattat acgtttaatt tgtatctata tatcttttgt tgataattga tgtaatttca 41520 

attttaattt accaaagaga gttagcacca cagcgagcat ccgttgcctc attagtcatt 41580 

agtacttatc accgacatct ttttgtttgt aaaaggacca ctgattcatt tacctacata 41640 

tataatatac aatatgtatg tatacaaaaa tcatagtaag gtttaaatgt aatgcttcat 41700 

gaataagata ttctgtgtta cagattaaga ttcgtgtatg ataaaatgtt tgttattatt 41760 

agagttaacc ggcaatttgt tcatattgag tctcattaat taccttcttt tcgcatgttt 41820 

tgttgacatc gagagtgacg atcctaccga gatagataag gatatatagt gataacaaat 41880 

tgagataaaa agctctttgc acagtcaatt atgattaaga aaaatatcaa atcagtttta 41940 

cagaccctag ctcattaggc agagataatt acatgcacgt aaagaaaaaa ttattgagtc 42000 

actaaaattg ggatagcgag gaatttgagt aatttgaact aagtcataag tttaaatcgt 42060 

atcgttaaaa aaaatgtagt ttttgttact cttttaaatg ctagtatttt tattttgaaa 42120 
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ggttttaaat accagtatta ttccactaat aacctgcctt tatttcttta tataaagcct 42180 

tctcttgatg aaaatagaat actaattaaa taatcgagaa aaaaaagata caaatggaga 42240 

acaaattatc atgaaaaaat tacacattag aaaatataca tgttttagca ttgaaaaata 42300 

caatggtcaa ttataaacca aagaggccta actaagtcta atgtttacgc caccaaattt 42360 

ttggttgata acgtttaaaa gtaatagcta gatggtctct ttcaaagaaa tttctgtcca 42420 

tattattcag gtttcaaatt ttgtttgtaa gacgaggaat ttgggatctt gatgataaga 42480 

ataagacagg gtgaataagt tcatttaatt aagatggaac gtgcgagttt aacttgagtt 42540 

acgtgtaagg tttcataatc aagtgtacat atgtatatgt attagggtag attaatgata 42600 

ttagctatca aatttaataa aatgtatatt taatattatt tttttatcaa cagtaaattt 42660 

tgttaattta acagttgaat ttaaagtttt cataaaataa attaaacccc acattatttc 42720 

aaaaagtaat taatactttg ttactacact cttaattata tgcataatgc atcatatttt 42780 

gtaataaaaa ctttatattt acacacgtat gaccattgat taacctacac tgtggcaagt 42 840 

acaccctcat tttctaacat ttataaataa aagttttttt aaacagaaaa ttatgataaa 42900 

atcttataat tttatatttt atttcattta tatttatata tttattataa attcctatat 42960 

ttatatattt gaacccactc attttacttt ttataattta tccacattga ttcaagttcc 43 020 

aaatctacac ccatcgagtg cataaattaa ttggaatatg ttttaactta atcaaaggtc 43080 

ttgagttcaa attttgaata tggagttact tacatattta aaggaagagt ttgttatcca 43140 

taatggttcc ataagactct cgaacaaaat tacttccaat aaaatataca tgtggtttat 43200 

aaaaaacaat tccatcaaaa ttttccaaaa ataatacaaa aaggatacaa atattttttt 43260 

taaaaaaata attcatttat tttgaataca tgacttttat atatatatat atatatatat 43320 

atatatatat atcaaccggg acatagtaat tcaagactac ttaatgttgt tcacccgtga 43380 

tacatgttaa ttcaatatta cacaatcatt atcaaattta attttagaaa atttaatatt 43440 

tttctccatt agcatatagt catttttatt ggaaaataaa tttgatgaaa catatcatac 43500 

taattaaagg ataaacatta taatttataa aagcattcaa ctatatccat taattgtaaa 43 560 

gaaaattttc aattgagaat cgaagttaat aattatcaaa ataattcttg cttttattta 43 620 

tgaaaatata ttgtgtgatt cttaattatt ttcgtaaata tataaaaatg aatatcatca 43680 

tatattttga agtaacttaa aatatattta atcctaaggt tctacatgct tgaacaaacg 43740 

tcttcatcac aaatctttgt agaaaaagta aataagacac taccaaaaaa aaaaaaaaat 43800 

caccaccact acaaataaaa aaggtacgca aaaagagagg ttacactatt accaccctac 43 860 
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acactgtctt ttatccacat attccttctc aatcggtaaa agaaccaata gctatgatag 43920 

acatccccgg ccggactcga tattttttca aatgttccct caaatcactg ttagttttga 43980 

tgttaaaaca atttgtttct tggttttgct agtgaaccgc ttgatttcat atagcaaaat 44040 

aagttttttt tttttttttt gtaggctaga aaaataagtt gcagtagata aaaataaaga 44100 

caaagcattc tgatcgctat aattgtaacc aatgtgcaat attaaagagg tgtctgagag 44160 

catacaatat cattttgtag tcttccaatt tgggttattt ctagtgtaca cacctcaacc 44220 

ttctgtgatg acagcctttt atacccattt cacttaattt gcccatgttc tctgtccact 442 80 

cgtttgatgt cttctaagta ataactatca gtttcattga ccttgtggtc ataactcata 44340 

tctaccatcc ttgagctaac acaaagaata aagagatatt taggaagata aaattgtgcg 44400 

aaagtaagaa aoattcaatt gtaatatgct tcaacaatag tatggccaac agtagtggcg 44460 

aatctaagac tctgactaag cagccataaa ttaaagaagc ttatttacaa ctagtgttat 44520 

cggagaatga aaaattgaag aataataagt tcagctataa taaactcgag ggaggaaaaa 44580 

caatgaaatt caagataaat agatataact tattaaattt aaggggtgta tttgcacacc 44640 

ctgaattata gagattctta tatctttgag aaaataatta aattgggaaa aaagagataa 44700 

tgactgattg agatttgcct cagaattgtt cgttttaata ttggtacgaa tctaatggtt 44760 

ttatcctgaa agatgctcac aagaattgag ggactaataa attgtttata aactactact 44820 

aaatgagatg agactttaag gtgtactgaa gcaatgtcat ttaaaaaatg actactcgta 44880 

tttgtgttga gaaaatttat tttcaaagaa aagaaaatat atacatataa gataaagtaa 44940 

ttaacataac gaaaggaaat aaaatgcaac attataaaaa ctacaactat ataaatggta 45000 

tatacaactc ctagcacatg cattggattg tgaattaatt aaaatgttgt atggatggta 45060 

aaaattcaaa actaaacccc acacaattta gtgacacaga atataattag cgttgttctt 45120 

tttacagaaa acgacgagaa caaaggtgtc aaaggaaagg ag atg gat gca tgt 45174 



Met Asp 
1 



Ala Cys 



ggt atg age tea tec aat tec aaa cat gtt gtg gac caa aag cga agt 
Gly Met Ser Ser Ser Asn Ser Lys His Val Val Asp Gin Lys Arg Ser 
5 10 15 20 



45222 



ace atg aac atg atg ate acg acg att ctt etc aga ttt tgg gac cgc 
Thr Met Asn Met Met lie Thr Thr He Leu Leu Arg Phe Trp Asp Arg 
25 30 35 



45270 



tat gat atg aat tgc gac tac act act aac tct tac gag ccg gg 
Tyr Asp Met Asn Cys Asp Tyr Thr Thr Asn Ser Tyr Glu Pro Gly 
40 45 50 



45314 
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gtcatcataa aaccattacc atttaccact cttttgaacg ttaatgtagc ctaaatctta 45374 

tatccagaga accagaccct gtttaaattt cctttttaaa atgtttctga taaatttctc 45434 

ttgcaagtgt ctcag a acc cag tta get cct tec tea cca cgt gac act 45483 
Thr Gin Leu Ala Pro Ser Ser Pro Arg Asp Thr 
55 60 

tea gtg aaa ctt gga gat gee age ag gtttatttca gecagggtet 45529 
Ser Val Lys Leu Gly Asp Ala Ser Ser 
65 70 

ttgtctctca gggcaattca ttaatttaaa aaataacatt tttttataca tattcatcag 45589 

tgcacgagga ggagggatag tatgtatcac actttttaat tcactttcta ttgttttctg 45649 

ttagttgaaa ttcaaatatc cctcactaat ttgagactga aacatttcac caaaaaaaaa 45709 

aaaaattgag gatggaactt tcttttttag ttgatcataa attttttctt ctaaaatata 45769 

taatgtggat acatattttt tgagattgaa acctaacaaa tgataaataa gactcactta 45829 

tttagtgaga catacatgaa tttcagagaa tattttccta tataggttat tagcatttct 45889 

tttaatattt tttttattat tgtcttattt ttaaaaagtt ggcattcttt ttaaaattga 45949 

cttttttgag atattgaact attttaataa taataataaa attaagttat atagtgtatt 46009 

aaaaagaata agataaaatg tgttttaaat ttctcaagac tttagtcaaa attagtttca 46069 

gtctcctcta ttaaaaatgt gttttaattc tcatattttt aaaaatatgg tgaatttcat 46129 

ttttaatctt gaacagttct ttaattttga cttaattaaa ttcaacatat ttcagaaaca 46189 

egggaaccaa aaccaccatt tttagaatcc aagactataa gatcttaatg aegtaaaaca 46249 

caatttaccc gtgagaatat taaagctagt agtattgett ttcagtgtgt ttcctacggc 463 09 

acattgttgt gtgtggaagt ggaagctaga aaacaaaggc agcagaagaa gtatggtcct 463 69 

acaaagtgtg tagtagtgaa gaagaaatag ccgttggtgg tggagaggcg egggtttgea 46429 

ataaaagaac agcgcgccat gatcctataa taaaccctgt caacaaaaac aagtatgett 46489 

catgaatagt tactatttac aaggaaaact ageegttact cactttttct tctttttttt 46549 

ttttgtaaca aattctgaac cctgcatgtt cattctctct ctctcacgct cgcaacccgc 46609 

gcgcgcacct acacttcttt tatgtcatca cgtgctcctt ctcactctcc ctctctctca 46669 

ctacaaaaac cattcttcaa cttgcaacac acgcacacac acacactcac acacactgtt 46729 

tttttgttcc actaaatcaa aacctcttat ctcttactct cattacattc attcttttga 46789 

ttttcgttat ggtagtagca gtggagaaaa ccaacctcac ttcacaatca caatgettea 46849 

accgtgtttc tgacaagaag aaagaaagat gcaagacaca catgaacaac gttaacccat 46909 
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gttgtttttt gtttctctta tgtgtgtgga g.c ctt gtt gtg etc ccc tea tgc 46962 
Leu Val Val Leu Pro Ser Cys 
75 

gtg agg cca gtt ttg tgt gaa gat gaa ggt tgg gat gga gtg gtt gtg 47010 
Val Arg Pro Val Leu Cys Glu Asp Glu Gly Trp Asp Gly Val Val Val 
80 85 90 

aca gca tea aac etc tta gca ctt gaa get ttc aag caa gag ttg gtt 47058 
Thr Ala Ser Asn Leu Leu Ala Leu Glu Ala Phe Lys Gin Glu Leu Val 
95 100 105 110 

gat cca gaa ggg ttc ttg egg age tgg aat gac agt ggc tat gga get 47106 
Asp Pro Glu Gly Phe Leu Arg Ser Trp Asn Asp Ser Gly Tyr Gly Ala 
115 120 125 

tgt tec gga ggt tgg gtt gga ate aag tgt get aag gga cag gtt att 47154 
Cys Ser Gly Gly Trp Val Gly lie Lys Cys Ala Lys Gly Gin Val lie 
130 135 140 

gtg ate cag ctt cct tgg aag ggt ttg agg ggt cga ate ace gac aaa 47202 
Val He Gin Leu Pro Trp Lys Gly Leu Arg Gly Arg He Thr Asp Lys 
145 150 155 

att ggt caa ctt caa ggc etc agg aag ctt agt ctt cat gat aac caa 47250 
He Gly Gin Leu Gin Gly Leu Arg Lys Leu Ser Leu His Asp Asn Gin 
160 165 170 

att ggt ggt tea ate cct tea act ttg gga ctt ctt ccc aac ctt aga 47298 
He Gly Gly Ser He Pro Ser Thr Leu Gly Leu Leu Pro Asn Leu Arg 
175 180 185 190 

ggg gtt cag tta ttc aac aat agg ctt aca ggt tec ata cct ctt tct 47346 
Gly Val Gin Leu Phe Asn Asn Arg Leu Thr Gly Ser He Pro Leu Ser 
195 200 205 

tta ggt ttc tgc cct ttg ctt cag tct ctt gac etc age aac aac ttg 47394 
Leu Gly Phe Cys Pro Leu Leu Gin Ser Leu Asp Leu Ser Asn Asn Leu 
210 215 220 

etc aca gga gca ate cct tat agt ctt get aat tec act aag ctt tat 47442 
Leu Thr Gly Ala He Pro Tyr Ser Leu Ala Asn Ser Thr Lys Leu Tyr 
225 230 235 

tgg ctt aac ttg agt ttc aac tec ttc tct ggt cct tta cca get age 47490 
Trp Leu Asn Leu Ser Phe Asn Ser Phe Ser Gly Pro Leu Pro Ala Ser 
240 245 250 

eta act cac tea ttt tct etc act ttt ctt tct ctt caa aat aac aat 47538 
Leu Thr His Ser Phe Ser Leu Thr Phe Leu Ser Leu Gin Asn Asn Asn ' 
255 260 265 270 

ctt tct ggc tec ctt cct aac tct tgg ggt ggg aat tec aag aat ggc 47586 
Leu Ser Gly Ser Leu Pro Asn Ser Trp Gly Gly Asn Ser Lys Asn Gly 
275 280 285 

ttc ttt agg ctt caa aat ttg ate eta gat cat aac ttt ttc act ggt 47634 
Phe Phe Arg Leu Gin Asn Leu He Leu Asp His Asn Phe Phe Thr Gly 
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gac gtt cct get tct ttg ggt age tta aga gag etc aat gag att tec 
Asp Val Pro Ala Ser Leu Gly Ser Leu Arg Glu Leu Asn Glu lie Ser 
305 310 315 

ctt agt cat aat aag ttt agt gga get ata cca aat gaa ata gga ace 
Leu Ser His Asn Lys Phe Ser Gly Ala lie Pro Asn Glu lie Gly Thr 
320 325 330 

ctt tct agg ctt aag aca ctt gac att tct aat aat gec ttg aat ggg 
Leu Ser Arg Leu Lys Thr Leu Asp lie Ser Asn Asn Ala Leu Asn Gly 
335 340 345 350 

aac ttg cct get acc eta tct aat tta tec tea ctt aca ctg ctg aat 
Asn Leu Pro Ala Thr Leu Ser Asn Leu Ser Ser Leu Thr Leu Leu Asn 
355 360 365 

gca gag aac aac etc ctt gac aat caa att cct caa agt tta ggt aga 
Ala Glu Asn Asn Leu Leu Asp Asn Gin lie Pro Gin Ser Leu Gly Arg 
370 375 380 

ttg cgt aat ctt tct gtt ctg att ttg agt aga aac caa ttt agt gga 
Leu Arg Asn Leu Ser Val Leu He Leu Ser Arg Asn Gin Phe Ser Gly 
385 390 395 

cat att cct tea agt att gca aac att tec teg ctt agg cag ctt gat 
His He Pro Ser Ser He Ala Asn He Ser Ser Leu Arg Gin Leu Asp 
400 405 410 

ttg tea ctg aat aat ttc agt gga gaa att cca gtc tec ttt gac agt 
Leu Ser Leu Asn Asn Phe Ser Gly Glu He Pro Val Ser Phe Asp Ser 
415 420 425 430 

cag cgc agt eta aat etc ttc aat gtt tec tac aat age etc tea ggt 
Gin Arg Ser Leu Asn Leu Phe Asn Val Ser Tyr Asn Ser Leu Ser Gly 
435 440 445 

tct gtc ccc cct ctg ctt gee aag aaa ttt aac tea age tea ttt gtg 
Ser Val Pro Pro Leu Leu Ala Lys Lys Phe Asn Ser Ser Ser Phe Val 
450 455 460 

gga aat att caa eta tgt gga tac age cct tea acc cca tgt ctt tec 
Gly Asn He Gin Leu Cys Gly Tyr Ser Pro Ser Thr Pro Cys Leu Ser 
465 470 475 

caa get cca tea caa gga gtc att gee cca cct cct gaa gtg tea aaa 
Gin Ala Pro Ser Gin Gly Val He Ala Pro Pro Pro Glu Val Ser Lys 
480 485 490 

cat cac cat cat agg aag eta age acc aaa gac ata att etc ata gta 
His His His His Arg Lys Leu Ser Thr Lys Asp He He Leu He Val 
495 500 505 510 

gca gga gtt etc etc gta gtc ctg att ata ctt tgt tgt gtc ctg ctt 
Ala Gly Val Leu Leu Val Val Leu He He Leu Cys Cys Val Leu Leu 
515 520 525 
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ttc tgc ctg ate aga aag aga tea aca tct aag gec ggg aac ggc caa 48354 
Phe Cys Leu lie Arg Lys Arg Ser Thr Ser Lys Ala Gly Asn Gly Gin 
530 535 540 

gec acc gag ggt aga gcg gec act atg agg aca gaa aaa gga gtc cct 48402 
Ala Thr Glu Gly Arg Ala Ala Thr Met Arg Thr Glu Lys Gly Val Pro 
545 550 555 

cca gtt get ggt ggt gat gtt gaa gca ggt ggg gag get gga ggg aaa 48450 
Pro Val Ala Gly Gly Asp Val Glu Ala Gly Gly Glu Ala Gly Gly Lys 
560 565 570 

eta gtc cat ttt gat gga cca atg get ttt aca get gat gat etc ttg 4849 8 

Leu Val His Phe Asp Gly Pro Met Ala Phe Thr Ala Asp Asp Leu Leu 
575 580 585 590 

tgt gca aca get gag ate atg gga aag age acc tat gga act gtt tat 48546 
Cys Ala Thr Ala Glu lie Met Gly Lys Ser Thr Tyr Gly Thr Val Tyr 
595 600 605 

aag get att ttg gag gat gga agt caa gtt gca gta aag aga ttg agg 48594 
Lys Ala He Leu Glu Asp Gly Ser Gin Val Ala Val Lys Arg Leu Arg 
610 615 620 

gaa aag ate act aaa ggt cat aga gaa ttt gaa tea gaa gtc agt gtt 48642 
Glu Lys He Thr Lys Gly His Arg Glu Phe Glu Ser Glu Val Ser Val 
625 630 635 

eta gga aaa att aga cac ccc aat gtt ttg get ctg aga gee tat tac 48690 
Leu Gly Lys He Arg His Pro Asn Val Leu Ala Leu Arg Ala Tyr Tyr 
640 645 650 

ttg gga ccc aaa ggg gaa aag ctt ctg gtt ttt gat tac atg tct aaa 4873 8 

Leu Gly Pro Lys Gly Glu Lys Leu Leu Val Phe Asp Tyr Met Ser Lys 
655 660 665 670 

gga agt ctt get tct ttc eta cat g gtaagtttcg tgtgctgttc 48783 
Gly Ser Leu Ala Ser Phe Leu His 
675 

tttcattaag tgttgtgtgt gctgttcttt aattataatt tggagtttta ccttagtaat 48843 

ctgtataatt etaateggag aacagtacaa acaaaaacac ctaaggaaca ctatagcacc 48903 

taaggaacaa caccttagct ttaatatacc atatcaataa gtgaattatt ttcttgttca 48963 

tettgatgea g gt ggt gga act gaa aca ttc att gat tgg cca aca aga 49012 
Gly Gly Gly Thr Glu Thr Phe He Asp Trp Pro Thr Arg 
680 685 690 

atg aaa ata gca caa gac ttg gec cgt ggc ttg ttc tgc ctt cat tec 49060 
Met Lys He Ala Gin Asp Leu Ala Arg Gly Leu Phe Cys Leu His Ser 
695 700 705 

cag gag aac ate ata cat ggg aac etc aca tec age aat gtg ttg ctt 49108 
Gin Glu Asn He He His Gly Asn Leu Thr Ser Ser Asn Val Leu Leu 
710 715 720 
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gat gag aat aca aat get aaa att gca gat ttt ggt ctt tct egg ttg 49156 
Asp Glu Asn Thr Asn Ala Lys He Ala Asp Phe Gly Leu Ser Arg Leu 
725 730 735 

atg tea act get get aat tec aac gtg ata get aca get gga gca ttg 49204 
Met Ser Thr Ala Ala Asn Ser Asn Val He Ala Thr Ala Gly Ala Leu 
740 745 750 755 

gga tac egg gca cct gag etc tea aag etc aag aaa gca aac act aaa 49252 
Gly Tyr Arg Ala Pro Glu Leu Ser Lys Leu Lys Lys Ala Asn Thr Lys 
760 765 770 

act gat ate tac agt ctt ggt gtt ate ttg tta gaa etc eta acg agg 49300 
Thr Asp He Tyr Ser Leu Gly Val He Leu Leu Glu Leu Leu Thr Arg 
775 780 785 

aaa tea cct ggg gtg tct atg aat gga eta gat ttg cct cag tgg gtt 49348 
Lys Ser Pro Gly Val Ser Met Asn Gly Leu Asp Leu Pro Gin Trp Val 
790 795 800 

gee tea gtt gtc aaa gag gag tgg aca aat gag gtt ttt gat gca gac 49396 
Ala Ser Val Val Lys Glu Glu Trp Thr Asn Glu Val Phe Asp Ala Asp 
805 810 815 

ttg atg aga gat gca tec aca gtt ggc gac gag ttg eta aac acg ttg 49444 
Leu Met Arg Asp Ala Ser Thr Val Gly Asp Glu Leu Leu Asn Thr Leu 
820 825 830 835 

aag etc get ttg cac tgt gtt gat cct tct cca tea gca cga cca gaa 49492 
Lys Leu Ala Leu His Cys Val Asp Pro Ser Pro Ser Ala Arg Pro Glu 
840 845 850 

gtt cat caa gtt etc cag cag ctg gaa gag att aga cca gag aga tea 49540 
Val His Gin Val Leu Gin Gin Leu Glu Glu He Arg Pro Glu Arg Ser 
855 860 865 

gtc aca gee agt ccc ggg gac gat ate gta tag cacaaatttt gcattgattt 49593 
Val Thr Ala Ser Pro Gly Asp Asp He Val 
870 875 

ttttgtgcca aatgtagtag gectactata tatatgttct atgattcttt cattcttata 49653 

ttatttttgc ctgtttgaat gcttgaattt gtacatactc atactacaat aaggtgtagt 49713 

tctggttaat tttacctcta cctcaaagct ggggtgtaat tctgtttcct ccaaggcaca 49773 

taatagttga aaatagttct caggagcatt cattgtttat tetgeaagat tctctttcac 49833 

ggctgetate ttctatgeat gccctgccca taaatgeatt atgaagaatt gtaacggctg 49893 

tgtttttgga cttcttcaaa aagtttatgt tattgecagg tgtatatatc aacatgtttt 49953 

aaagattttc aaacaatcag gttttagatg tgggtttgca tgcatgagat tggactagtg 50013 

cgcttgatgt agtataaaat ataaattgtc caatcagcac cctctacatg tccaaataat 50073 

gggccttatg aaacttaatt ttttaattac aaactacagt aatctttttg aataaagatt 50133 
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tacaaattac aacagacatg tgaagtcgtc atctttcatt gccaattctt tcaagtttac 50193 

tactattatt ttcctgcaag cattccacat tcacatctga taactatgac agcatcttcc 50253 

aagataatga cttccaagtt ccaacactgg ctctgtacat ttgaactaat tttatatcat 50313 

ttatctattg tgattgaaat ataaaattga agtgatgtga acaatacaaa tcacatcttg 50373 

aattaaaata tctaacaact ggaacaaata agaggcccag aaaaaaggga taaataacgg 50433 

ataacaagac agaaagaaaa aaaacccaac ataattccaa cttcaaaatt cactcaataa 50493 

aaagtttaac atgtaaattt acttggaaac aaaactcata accaataata ataataataa 50553 

aagaaatcag ttttatagca ttaatttggg atgctctgct tgtatgcaaa tggcacaacc 50613 

ttaccctcaa gattgcaaaa cacagatgag taacagatgc aatgtgaatc aataaaaagt 50673 

attgttgcgt tgttgatgac acaaccttac tcataaaaaa tgcattgttg atggctagca 50733 

ttgttgcaag gtattcatac agtttattct gcaacataga gaaaatacaa ctcatcaaca 50793 

ccaggaaatg gttctgttca aaatcacgga ttataaaaag ttattatcta aatgttacaa 50853 

gcttaagaag atctatcatt gtgaaagtct cttatgcaca ttaatattac aagcttaaga 50913 

atgctataca aatgtttgag gttttgatat ttaacttttt atgatatgct ttgatttaat 50973 

agttgcaaat tgccacattt ctcatgtcag ttactcgtat tctcccataa ataaataagg 51033 

cttcttctgt ctcaatttat tttacttcta aagcaacaat ttctttcttt ctcatttttt 51093 

ttgtaccctc tgatcagatc gtagtccgat cctcaagcct tagcctctac acactttgtg 51153 

ctcgaggctc gatgattgta tatgcttctg accggactgt agtctaatct ggtggtggat 51213 

cccaatctga tctgataacc tccacagtat tgtgcacatt ataacactga cgtagggtta 51273 

accatgtacc gagatccttg ggcacagcat cttgatacgc tcaattggcc tcgattcctg 51333 

aatatcggag atgggagagt aggcaggtcg caataaaagg tcataccatt aacgtaggat 51393 

ggtaggttaa gtaataatac ccttgacttg atgctctcaa ctaggtcacc ttgacatacg 51453 

acaaagcctt ttctaaaaca attttataca ttgaaatttg gaaatatgta tgcattgaaa 51513 

ttcacaacaa tacttttcta actcatatcc aaatttaaga tcagcctaaa agcccaaatt 51573 

aggggatgca aagaattcta tcgttaaatc aaacacatca gtcaaaaaag aaaacataca 51633 

taggcacaat caactgatca agtataacag tatttcagat attgttagcg actagaaggt 51693 

ttgtatttaa tttgcaaaat ttaatatgaa tactactggt tctatttatt ttataaactg 51753 

gtattagtaa tgaaacttta aaatggggaa tattttagca aaaaataaag gttaaaatat 51813 

gtctatgatt cttaatgaat attccaattt tatgtttgct ttttagtata aaaaaattcc 51873 



106 



WO 01/51627 PCT7US01/00552 

gtttttgttc cttgataaaa aaggaaattt gtttttggtt ttaaacactt tttttagtcc 51933 

ctattaaatt acgaatttcc tatttattct ttgataattt tttgtttatt attagtccct 51993 

tcaaaaatta ctaataataa atatttttta acaggaaata aatacaaaat tctctaattt 52053 

attaggaatt aaaaaaaatg ccaaggacaa aaaaattatt attttattaa gaattcaatg 52113 

caacaaaaaa tttacctaaa agcaaataaa aaatttgagt atttcttagg aattaaaaat 52173 

attttaataa aaataaaata aagatccaaa tgatagtgtg ataaccgaag aggaatgtct 52233 

ttcaaccact gcctgaccgc caccactgcc aacagcctag tatcaaccga atccacatat 52293 

accaacaatc ttcagacaaa cacttctaag ttggtgctga agagacaata tctcatgggt 52353 

agatcaaatt aagagtgcta ccaataacaa aatcgggatc atttgactaa caaacagtta 52413 

tgtgcattgg atgttctacc atagtacatt gctttatgtg aaagtctttt aattattcaa 52473 

tattgacatg ttcttatata tatatatata tatgagggat tgtattatct ctgaaaaaag 52533 

attttatcat aaaatcataa tgatttctca taatgtatct ttacatttta aagttagata 52593 

aataaaattg attttaaatt gttagatata attaaaatac ataattaata tgacttttaa 52 653 

caaattgata tataaacact taaaaaaaag tttcatgacg tacggtgtgt attgttggta 52713 

caaaaaaaat ttatactatc aactaattaa aattattata aataataaaa ttaataaaaa 52773 

ttactataat aatctgtaat tagattattg taaaattgtt ttataatata aatatacagt 52 833 

cttttttctt taagaaaaat tgctagacca agcaatatgg accatgtgct ttctgaaaat 52893 

atataacaca aaaattccat taagtttttt tgcacctata agctacatcc gctacgtact 52953 

gcatgtggag cctcatgagt gtgaggatct tccacaggtc actagtttga catctgaaag 53013 

ctcctcgtgt aaaacgtgaa aacaaataac aagcttggac tggtgtacga tttagtgtta 53073 

ctagctatcc catgtaataa atatataaat cttgaatcac aaggaatgat gcaatatatg 53133 

gttcctctaa tagtaagtta tcccaccaaa tctgaatata attaagaagt tgtattcgtc 53193 

tgaatgttgt gtctaaaagg gttgattgat gaatgatggc tacatgtgag agtttgataa 53253 

caacagctag ctagccatta gccaagccac taactagaca ttagttttgg ttggttgtca 53313 

gacaaaccgt tagacctgag aacgaaagcg tattaaacaa aagatgatat gtagactttt 53373 

aatataaaaa gagatggaga aaccaaattg agatttgata ggtgaactat aaatcatgac 53433 

agtgcattag acaagttggt agagtttgtt actaactcat cagattctta agaaaggcaa 53493 

aaatagaaac tacaccacat gtcgctagcg ataacgtgca atttataaat aaataatggc 53553 

ttcattttca tggttagtta taaattaatg ggtcacaatt cttaatttat taggaacgta 53 613 



107 



WO 01/51627 PCT7US01/00552 

tacttcattt tgagagtgta taaagttgga agaagaaaag ggatatagaa agaataaaaa 53 673 

aatggattta tctaattcat cgtaaatgaa aatgagatta aatcattcaa tcttcattga 53733 

ataatagaat ttaaaaaatt gtcttattct gaattgtatc attaatatta taactatcat 53793 

atttaatgta ttatctttct tatcatttat gtataaaatt aaaaatttat aattaaaatt 53853 

atattaaaat acataaatat atgtaataga attataaaaa ttaaaattac agctatatat 53913 

aatttcctgt cacttagact tgcagtagac aattgtttgt agttaaaaaa tatgaactgt 53973 

agtccggcta cgactattaa tttcaagact tatttaacag ttaataataa ttttttgtca 54033 

tctcgtcccc atttgtcttg cgtgttaata tatgttataa atagacattg atatattttt 54093 

ttatgtgtat ttgttgatac ataacaacaa taggattgta attcagattt taatatttta 54153 

ttacatgttt atgtatttta ctataatttt aattaacaat tataaagttt taattttata 54213 

cagagaaatt aaaaatgata cattaaatgt aataatcata atattaatgg acagaattca 54273 

taattagaca aatgtttaaa ttgtattggt tattgaagat gagatttttt attttcattt 54333 

ttacttgggg tcttagacaa acctaaaaaa aagagaaaat aacacatgta ataattaagt 54393 

gaagtaaata taatggaaaa agatgacgaa attaagatga gaagaaaatg ctatgagatt 54453 

gaaaagatat taagtgtctc tttatataaa actcaattaa ttacgagtta actggtcatg 54513 

catgaaagtg taaaacattt ttatgttatc atttaattat aaattatagt taatataatt 54573 

tttaaaataa ttatcataaa aattaataaa tttactgtac aagatgaatt ttaattaaat 5463 3 

agtgatataa ttttttttta cactaacaag tgtataattc tttatctctt taattattat 54693 

ctaacttttg attccacggc atgtcaatat tttctctctg accaaaataa ctatcaaggt 54753 

tagtaaacga atataaagac aaatccatca tgttcttttg tgtcaaaatg aggccttctt 54813 

aaagatcacg ctcaatgata tgtagttttc taagtcgcta aaatgcatgt taccctcatg 54873 

aagctataat agggttcaga gatagcttta gaagttcaat agagcatgtg gacctgggag 5493 3 

tgaggtcgta tgtcgctata atgctgtata aacttttggt gagcatgcat gaccatttta 54993 

ctactggggc ttccatagtg ggtttcagtg atagtcttca taagttcatt agtctttaca 55053 

agttcaatag agagaatata tggggacctc ggagtgagga ggttgaaagt cactataatg 55113 

ctgtataaac tttggtgtgt atatgcacca tctgatggcc atccaatgtc ccctagggac 55173 

aacagggtac ctaattaatt ggtaccacaa cggggagaaa atcaacacgt ttgtggaata 55233 

tacataccta gaattgaagg gctagctcaa tcaagctaaa cttgaattca actatagaaa 55293 

ttaaattaaa ttgaaatttg gttacacgag tcaggaccat tagttataat taaaatgcgt 55353 
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tagcacaatt tctaacgcta taggcataga agcactaatg gtgacacaca ctagtataaa 55413 

aatactttta atatcagtta ttttagattt ttttgtttgt gtaagtcaat caattttaaa 55473 

agttacttct aaatcaactt taacaaaaac taatgtagaa atgatctaga aaactttttt 55533 

tttaagttct aactcttttt catcaatgtt atacatatat atatatatat atatatatat 55593 

cccaaaataa ccaatcaaat aaactactta gtttacttat atgttaaatc atcgacctat 55653 

tacaagagag gaggtctaac tcagatgatt tattataatg ttaagttatt ttaaatcttt 55713 

taatattttt atttgatttc tatgaataaa aaaaaatcaa cttacccaac tttaaagtct 55773 

caagtcattg gatattatct tttttaaata tatatatata tatatatata tatatatata 55833 

tatatatata tatatatata tatatatata tataaattga acatttgttg ttatattagc 55893 

taatattgaa catttattta tatcttaaat aacatattat ctatcaaaat aaaatgttat 55953 

aacaaataat aattcatttt tatttgtatg aattttcata aatttattta ttttcaaaaa 56013 

ctttgaactc aatgattgta tgaaataatt tatattctta attaatgaat tgatcgtatc 56073 

ctatatatgc atagtatatg aaaaatcaat tctcttaaag taacataaag aggccctttc 56133 

gtttaagtaa ttgaattact tagacttcaa aaaaacaatt gaggggaatc aaacaattaa 56193 

cagataatta attctagcaa actatattct cctatatatt gtcaaatatt tgaaaattaa 56253 

aactgtaagt tatgataatg atattatagt tgaacacgtg gcagtatggt gacccaatgt 56313 

tcgtcaaaaa tcaaaatgat gtgtgcctgg tacagtcaag tacgtatctt tgagtatgac 56373 

attcttgtgc atgtaattct ttgctgatct tatccatgtg taaaataaaa ttagttatgt 56433 

tggatgacag gccacctagt ttaatacttg agaaatattt ttcataagta tctctaacta 56493 

aactcttgtt agatgcaatc aaatcacttt agttacttat agtactgtta ttttatttgc 56553 

tcacgcatca tggaagtcta cggtacttat agtactgtta ttttattagc ctaattatcc 56613 

atgtataaga atcattgaat aaatagttag ttttaccaga tagaaaataa aagagggtaa 56673 

ggaacaccca acctatcatg agagctaaag cttcacaaca agcaacgaac agcttttaac 56733 

cttaaactag gctaatgcca atattaaaga agaaataatt aaaattgtaa ggctggtcgt 56793 

gtataaatta aacaaaaggc cctctattca aaccttcata tatcatacct gtttttaatt 56853 

aacgcggact actttttcat ataaaaaaaa gatcattaga ggattaattt aaagcgtttt 56913 

agtttttaat taccaaagag aataattatt attaggcgct ttgtcccaca atcaatcacc 56973 

taaacaagaa aaagaaaaag aaaaaaaaag tcaaattgga ctaatgcaaa agtggcacaa 57033 

tctttgtctt gaactcttta attagcaaca aattatactc ttctgcacaa atcacaagaa 57093 
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taccttacat gaaaagaatg ttaatttgac ggtttacatt aaattatatg cagttttctg 57153 

caggtaatta attttcaaga atttaaggtg gttggtaatt ttcaatagct agcttgacta 57213 

gcaaaggaaa gaataaagtt aaaatgcttc ttgttttggc ctttttggat tgttatactt 57273 

tttgctaaac ggaaatggtt atatgaatgg taaaggagat aaattgttac atagtctaaa 57333 

attgttatag tcttaatcaa tcttcaaaca atatataata taattttatt aactattttt 57393 

tataaattaa aattttcaat tatatgtcac ataaaaagga attatgacca taaatatata 57453 

atagtcttaa atcatgtaat aatttgtcag caaaataaga gattaaataa agttagaagg 57513 

aacacaatca gtggattaat taaaacccgt catataaact aaattaaata tcaaatctca 57573 

aagtacgcat aaaaaatctc aatagatttt tgttaataat taactagtat tcttaaaata 57633 

ttgattaaaa aattaaaaga aaaaatatat ttattatata aatcagaaaa atataaaaaa 57693 

attataaata atacaatttt tttcctccta ataacaatat ttttatttga tgtccttaga 57753 

atattgggct aacatttaac atataattat acttgggaaa aatttactag aaatagtaat 57813 

aatagattct ctaacacttt ctcctaacat agtctatgat taatttaaat ttattgaaaa 57873 

ctatgaagtt atgagagaat tattattttt gtaattttta agaaatttca actagagaaa 57933 

atatgtttaa aagagcctgc tgttaaactt tctcaaaatt tattttcaac ctctaaccgc 57993 

agacttctga aataagcatt catgcacttt attaactagc gggtgcaaca actcctatgg 58053 

gtttggaacc aagttaagtt tccctttggg gtctgacctc aactaaatta acctaatctg 58113 

cctaacctca aaggacttta tctttccccc acctctaatc caccctataa aagcaccctc 58173 

tcccactctt acttgcattg caaccttaac cttcagcatt cacactaagg tgttccttgc 58233 

tcgccaaaag atcatggagc ctgccaaaac cattcacaac aatgtcaaat actcccccat 58293 

cttcttagcc atctttgttc tgatcttagc ttcagcattg tcttcagcaa atgccaaaat 58353 

tcacgagcac gagtttgttg tacattctct cactctcttt ctcttaattt ctctggactt 58413 

attttattct tgtttttttt aactcttttc cgttagataa taattaccta gctgttgatt 58473 

gtaatggaat aggttgaagc aactccagtg aagaggctgt gcaaaaccca caacagcatc 58533 

accgtgaatg gacaataccc gggcccaacg ttggaaatca acaatggaga cactttggtc 58593 

gtcaaagtca ctaacaaagc tcgttacaat gtgaccattc attggtataa tatcaagcta 58653 

gcatcttaac ttcattttag attatgaaga ccctttgact taattttaca catctgctta 58713 

ccacgattta tagacatata gatattagtc ttacgaaaaa catttttctc actttataat 58773 

tatactactc ccttcactcc tttttatatt aaaatggaaa gagtctctct ctaccatgtg 58833 
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aaaaataata tataaataaa aacatatata ctctaccatg tgaaaaataa tatataaata 58893 

aaaacatata tacacttttt acttcttgac attctaattt tttttacctt tctttctttc 58953 

ccatgatatt tatcatttat tctctcctct attctcattt atctatctcc caaggtgtgt 59013 

acattccatt agaatgtgaa aatgaaaaac attcacaagc ataatgtaaa aaaaataata 59073 

ttatttctca taaccctata tatatataca cgccacataa tacgtacgaa cgtaagtgat 59133 

actatcatga aagttcttga atggctttct tttcagggtg aatacatata ttaatggata 59193 

gtggtttttg ttggtcattg tttcttatta ttatgtcctt aggcacggtg ttaggcaaat 59253 

gagaacaggg tgggcagatg gaccagaatt tgtgactcag tgcccgattc gtccaggagg 59313 

aagttacacc taccgtttta ccgttcaagg acaagaaggc acactttggt ggcacgctca 59373 

tagctcatgg ttaagggcca ccgtttacgg tgctttaatc attcgtccta gggaaggaga 59433 

accctaccct ttccccaagc ctaagcacga gacacccatt cttcttggta ttttaatttc 59493 

cttcttaatt tactcatgca tatgcattat ttgtaattat agcattcatg gtaacatgga 59553 

ggcaactatc taaaaaagaa ttgagattat ttaaaaacta ataagtgatt gtgatagttg 59613 

tgattaatta attaatacta ttgaagcaaa gagacaatat atatagaaat tgtggttttc 59673 

tgttgtttaa ttttgctttt ggacaaagat taaacggtta aagtgatgat ggtgatgatt 59733 

taggggaatg gtgggacgca aaccctattg atgttgtgag gcaggccaca cgaactgggg 59793 

gagccccaaa cgtgtctgat gcatacacta tcaatggtca acctggtgat ctttacaagt 59853 

gctccagcaa aggtttgatt aattgcttct taatttcgat tgcattaatt gaacatgtca 59913 

catgtcttgt ttaaataaat ttactttgca aaatatttga cataattaaa acaggatatg 59973 

cagtcataaa aaagagaaaa cgacatatga tatgaaatta ttaaagatgt caattattta 60033 

tgaaacaagt caacaatagg ttgccttttg gtgcagcgtc tatttcatgg cctttctact 60093 

ttttgttctc ttttgaagac aaaagtgtct tccccaccaa ataaaaagaa aaaaaaatgc 60153 

agaagacttt taagtaaata tatagtttat aaattgcaag ttttagcaag aatttttaaa 60213 

aaatataatt tgatattttt tttcataaat tagaaagaaa aggctaacac tttttcttaa 60273 

aaataattat attataaaat tgtctattta aaaaaaaaac cattgaaatg acttaaagcg 60333 

aagagatttt atacgcgaaa cctgctttta atgattttat gcaaccaaca aggttgcctg 60393 

caagtcaaat ggaaaaaagg catttaaaaa cataaagtta atcaaacttt tcatttcttt 60453 

aatttagatg atgtatcatt ttaatttctt acattttctt aaaatattaa ttttatgcat 60513 

tttcaacata acttttttta tatattcaac taatgagaaa atatgataaa tataatttta 60573 
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aaataattat tgtaaaaata tttttttact ataaatatta atttgtaaaa aaatctttac 60633 

actataaatt atttactatt cttttttttt tacttcacat ggattccttc ttaaaacttt 60693 

catttttttt actcaaaatc tgtaaataaa tccaataaat tgttttaccc ttttggtttc 60753 

atgcagacac caccattgtc ccaatacatg ccggcgagac caaccttctt cgtgtcatca 60813 

atgctgcact caatcaacct ctcttcttca ccgtcgcaaa ccacaaactc acagtggttg 60873 

gtgccgacgc ctcctacctc aaacccttca ccaccaaagt cctcatgctg ggccccgggc 60933 

aaaccaccga cgtcttaatc accggcgacc agccaccttc ccgctactac atggcggcgc 60993 

gtgcgtacca atccgcccaa aacgctgcct tcgacaacac caccacaacc gccatactcg 61053 

aatacaaatc accgaatcac cacaataagc attctcacca tcatgccaaa ggagtaaaga 61113 

acaaaaccaa acctataatg cctccactcc ctgcttacaa cgacacaaac gcagtcactg 61173 

ccttcagcaa aagcttcaga agccctagaa aagttgaagt acccactgaa attgaccaga 61233 

gcctcttctt cactgtgggt ttaggtatca agaagtgccc caaaaacttc ggaccaaaga 61293 

ggtgtcaggt attggactat tcacctaatt ctattatcat gcatcaattt aatttgcatg 61353 

tacgtatctt atcttaagat ttcaataaat gtctcatata ggaaaaatta cttatttatg 61413 

tttataatcc ccacaaattt tacattttaa tccatactct taaaaattaa gtctaattta 61473 

atttcttatt ctttaaaaat gactgatatg ttctgatacc aaagaattca aatattaaat 61533 

atttttattt tttgtctttg tattctattt tttcataaat tctaatcttg ctaataattt 61593 

caattcatat taagatcggt aaatagaaaa tctagaaaaa aaaacaaaaa aagtattttt 61653 

ttttcattga ttttattttc aattgatttg tcattaacaa actgattcct cttaaatctc 61713 

acaaaagtac atgtcgatat aaatatgaga ttataaattc atgatatcta ttttcgattt 61773 

ttacatataa tgtttttttt atctttttta gttcctaata agcattttta aatgtcttat 61833 

gttcctactt tgcatatcag ggacccatta atgggacgag gttcactgcg agcatgaaca 61893 

acgtgtcttt cgttctcccg aacaacgtgt ccatcttgca ggctcaccac ctcggaatcc 61953 

ctggagtgtt caccactgat tttccgggga agccgccggt gaagtttgat tacaccggca 62013 

atgtgagccg ttcgctgtgg caacctgttc ccgggacaaa ggcacacaag ttgaagtttg 62073 

ggtcgagggt gcagattgtg ttgcaggata ctagcattgt cactcctgag aaccacccta 62133 

tccatcttca tgggtacgat ttctacattg ttgcagaggg tttcgggaac ttcgacccaa 62193 

agaaagatac ggcgaaattc aaccttgttg atccaccttt gagaaacaca gtggctgtgc 62253 

ctgtaaatgg atgggcagtt attcgatttg tggctgataa cccaggtaaa taaatagggt 62313 
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cttgttaatg ggcgtaacat tagttaggaa actaaatata aaaaatattt attgtatatg 62373 

atataagaga atgtaaaaaa atttataaaa aacaactttt catattttaa taaaaaaaat 62433 

cttttaattt cttaatcatg ttctaagaac acaaattaat atttaactag taaatattca 62493 

ctattgagtt ttaattagat aacactctag aaatattatt tatttatata tgaatatata 62553 

ctttctttcc agttttcatt tgtataatgt cctttttata aaaagaaaga aagaagcaaa 62613 

ctcaatcatt tgatgggtgt gttaattgtt aattgtaggt gcatggcttt tgcattgtca 62673 

cttggacgtt cacattggat ggggtttggc tacggtgttg ttggtggaga atggagttgg 62733 

gaagttgcaa tccatagagc ctcctcctgt ggatcttcct ctttgttagg atatcatttc 62793 

aaaatattcg ttggccccca acaatcggag ttttgcagtt tttcttagtt tggaagctgg 62 853 

ttgatgcttc cctgcattaa ttttgggagg gttttttgtt ttgcttcatt gctttgtttt 62913 

aattgtacgt tctttttcta gagagagata aattgggttt ggaacctagg agaggtggtg 62973 

atgatgcggt ttcaacgcaa ctataccatc tgaacagtca ttgcccaact agttaaattg 63 033 

atgttatttt ccccccatag ctcaagttac aaacagattg caagttttga atatcaatat 63 093 

atcagctggc ttttttttgg gttgcaactc ttggtctctt gaatataaat atgttgacat 63153 

gtattaagtt tcaaagtccc ctaaccattt gccatcataa actaatacaa aattctcaca 63213 

ttaattcaca gtatagtgta tacagaggga tgcttatcat agcatcaagc attaaaatcc 63273 

ctcaacgaat gttgtcaaca tatggattat gggataaaaa tataactacg cttttctgat 63333 

agagggtggc ttttggtttt catcaggctg aatgtgcaag ctgatagagg agtattaaga 63393 

tagagtagac tattgtaatt gagaagcaga aagctactgt cgctgatcat gtgagtgggt 63453 

ggcaaatgta ctagtccagt agtcctcttg agttttagcc caataatctc atcaaatgaa 63 513 

tctctgggcc tgcttcatta atgctaaaag ttgcctcttc ggcctatgct ttttttttct 63 573 

ttgttaataa agtgatgaaa ataaaggata tattcataaa ctttatttat gtttactttg 63 633 

tataaatttc ttaacaaata cttattgaga aaaaaaaatc aaacttctct ,cataacataa 63 693 

aattaacttg tgcatttcta actttttaga agttttctta tttaacttcc ctaaaaaaca 63753 

ttacatttgc tacagtaact tggtgacaga tttcatgtta gctctcgcat gatttcaatt 63 813 

gtattacaac atttgtactc tcagttatcc ttaataaaat gatatgattt tccttgccta 63 873 

aaaaaaactt ggtggcagat tttgttgcat agattaattc atctatgtgc ctttaggcat 63933 

ttgctagtca tgatggtcta taaatgcagt ctttacatgt atatttctag attttaacat 63993 

tgttgggttc tgtactatct tttggggtct ttatttggat tcaccttgtg cttaaaccta 64053 



113 



WO 01/51627 PCT7US01/00552 

aaggctattg atcattttat tctttttctt tttgaatttt tttcattaaa gttttttctt 64113 

tttataagtt gattttaatt tttagaagaa acttaattca ttthctctct tctttttcct 64173 

cctgatgaaa aattattcaa attgactttt aaaaagattg tatcatatat atgtgacatt 64233 

ttctttttca ttatttttta tgttcatgtc acataaaatg caagtctcat aaaatgtggt 64293 

tgtaggacaa aaagtgatac tatactccac atggtatata taccaaaata aaagtaatac 64353 

acggaatcat gtgaagccta ctcaagtaga tgtgcaaaat cttgtgaatt caaattagtt 64413 

gtcttgttta ttcattacct tttcaatttt tttaatcatc ataattaagg cctttcgaat 64473 

ccctttaagt gataaaagaa acgtgcaatt atgcaacaaa taaattttcg ttatgttact 64533 

atttagtcaa ggaggaaaaa aaagtgataa gggaagaaac aagggatatt tcctgttata 64593 

acaaacttaa aatggcgact attttgacga cattgcaaat actcatagta cgatataaat 64653 

tttgaattta atatacaatg aataggcata ttcattttct accccaaaaa agcatactca 64713 

tttatgtaca tttaattttc tctccataga ggaattaatg tacaaccatg cataagggat 64773 

gagcaaaagg gacagattat tgcaatccag aagcatccaa ggaaagttgg ataaacaaat 64833 

caattaatat atataaaaaa aaaacaaaaa tgctcctagt aaaagattaa aggaagagtt 64893 

ggctatatat ggcaaacctt ttctaactgt tttaccctct tctcatcacc gcattgcatc 64953 

accaatacgg gaactttccc attacaaaac tcattggaag ccaacatatc ccccaaaatt 65013 

cccaactgat ctgcattgtc catgaaattt gacatttctt cttctacaaa attccccatg 65073 

ctatgtcgtt ttcccaccat cactaggtca tagtccttct ccattccccg aatcgctttc 65133 

aacacttgta tgcaatcttc caccacagcc tcatggtaaa caacgttaac actatcacta 65193 

ctaatatcat tcttggcaat aaactcatct attagactct cgtccaacgt gctctctaaa 65253 

ttctcatctt cattttctac aagtccatta aacctcgaat cttcattggt aggcaaaaca 65313 

aaccggaaca aggtgacacg tgtgttgggg cgttccaaca ttcgaatccc caacgccaat 65373 

gcttctctat catcttttcc accaatgaag aagataccaa catcaaaaga caacttggag 65433 

ctactcccac ttagcaccga gtatctatcc actagtatcc ccaaagtacc ctttgcatta 65493 

gcgagaaaat ttgtgttgag gttgcgaatg gtgctggcca ggtggcttcc tagggtttga 65553 

tcattttggt gaaagggtat aatgagaaga tgcactgagt tatcttcggc taggttacaa 65613 

actgcctcat gcatgcttct gtacggagcc acgttaacat acgaaagaac cgttacagga 65673 

ccacttgagt tgttggagta gttctcaaag gcacgtaaaa tgtggttggt gttggggtag 65733 

tttacggaca aagattttcg cttgtttttg ttcatgggaa gaagaatggg tgtgcttttt 65793 
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ccaacgagct caatgagatg gaccacgtag acgtacaagg ggctttgtgt ggtagggttg 65853 

catgcttcta ttagggcaat catgttgtgc acgtgttcgt ctgtgtgcac acatgatacg 65913 

atgttgaacg gtgtattttc ggttatgttt tggattgttt tcacacaccc ttcttgtatg 65973 

gtttgcgtct tacacaccct acggtgcctg tacaaggatt tgatcaaggg tatgcaaatt 66033 

gatgtcatga ccaccacaga catcaccgct acactgaata cttctgtatc aataacctgc 66093 

aaaaaggaac gtcatgacat aaattagatt taatatttca ctaacaatat gtatagcacc 66153 

cacattaaaa agcatgagag tattgaaatt caaatcgctt caaagaaatt atcatatgac 66213 

ccagaactat catgtattct tgattccgat tgtcaatttg tattacatgg catataatca 66273 

attttattat ttttttatat atgaaaattt tgattatata tatcttgacc gaaatcgtag 66333 

acatgtagtg atgtcaaatc ataaaataat tccatcaatt aaaaacttac atgcataatt 66393 

aatatttttt aattgagaga cagaaattga gaaattatta gacaatatga catcatcaca 66453 

tgattatgat cttattcagt ttctacatga tatataatca tgtttttttt tcttaattta 66513 

caaaagaaag aataatatat atgactttat tatgtgtcat atagatgtta ataagaatcg 66573 

tgaaaatttt agaccacttg atcatttctc gaggaaagag catgaaaagg agacaagcga 66633 

tttgatttct acctctcata aaatagaaga aaatatataa gagaatatgc attaccctga 66693 

gtttattcat tctaccgtaa aagatgagtt cgactatacc cttgacattc aatatgagcc 66753 

caagcacgac gccatgtttg ggcttaatat tataggtggg agaaatgagc gcacatgcaa 66813 

gtaccttcac caagcatccc acaaacaaga tagccagaac tactagaacc acctcccaat 66873 

gctcatgaat taaggtcaag tctgtccttg tgccaatcaa caagaagaag aaaggcataa 66933 

aaaactcata cacgatcaat tcacttctct ctatgattgt tgtggcgagg gggggtccat 66993 

ttggcagaac caaaccatag agaaaaggtc ccatgacgaa atatatgccg aatgtgtcac 67053 

taatagctgc cataaccaaa ggccccagaa gtaacaagac aacgtatgct tctttaattg 67113 

gcttcccagg tggtgttctc tccaaaacaa tgttcactag tggtcgtata ataagaagta 67173 

ataaaacagc aaatccagtt gcacctatca aaagcacgat tagaaatcgc atgctgaatt 67233 

ttgagttgaa caggagttcc atcgtagtcc attgcaatat ttcactgatc atggctgaag 67293 

atagagcaat ttggccaagt tctgtggcta caaggttgag ttccatcaag gtttcggata 673 53 

caacagcgaa actgctcaag gtgaagatgt tcgggaagtg gtagattgac atttggtttt 67413 

gattagcatt accgttggga gaatacaaag aaaacaaggt tacagtgacc aagaaagaag 67473 

ccaagaaggg aaacacaccg aatcgccaac aacgtttcgc tgattttaat gttgtcacca 6753 3 
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cgtccatttt caaacaagtg aggaacacgc aatatgtagt gcctattttg gacaacgtat 67593 

ttagaaacag ggattgtctt accggaaaga gggctcccaa tatttcttca tggcgcccta 67 653 

agaatgttgg ccccaacaga atgccagcct gtaaaattta aacgaaatta agattattta 67713 

tgggtatgtt ggatagaaaa aacaagacca ataaatataa aaatgattat atatgtgtat 67773 

gttttctttt tttgtgggaa aaggaacgat aaacttctaa aaaaacctta agaaaacacg 67833 

cttgaagaag cgtctgccat aatgccgtta aacaagatag aatgatcaaa cttggtcagc 67 893 

tcgtattgaa cactagtaat taaaattact aataaattaa cctatatatg agaaatgtag 67953 

aatatattag gagtgaatgc ttacgatgac gcagcagatg aatttgggtg tattgatagg 68013 

cctgaggaca taatgaaggg ctttagagag taaactgact aaaatgattt ggaacaatgt 68073 

gacaggaacc acaaaatcga atggattatc acccatccac acgcctaaag agcctacagt 68133 

tctatcattc ttgagacaaa ctatcatttg acccgttcta gcgtcattaa agaccgtggt 68193 

aatggaactt tcactcatta tatatatgtt tgtaccaagc aattagtgtt gacaagaaaa 68253 

acaaaaacct ttctctcttg ggttgttggt gtttgcaaca aacgcaaaca cgagcactac 68313 

ctgagggaga acaaaagtag ataggagaaa agcatatatg tgcattttta tatgctctaa 68373 

tcaaattggc agataaaata attatattta acaaaatatg agctgcgaat gtggtgttag 68433 

ttattacaac ggttcaaaaa tcaattatag caatgataac tttagtcaaa tgtattccga 68493 

gttggagttt cgagaaattt tggctaggaa agaaaattga tatgctatca ctgagctaag 68553 

gccagtgtgt aacgaaaaac caagaagagt gaatttcaat ttagtctctt taaggtataa 68613 

ttttaaccaa ccgtgtcaca aatatcgcac aacaaaagtt tgtgtacaaa tgatttaatg 68673 

aagacaggtg tagtcaacga gaataactgg ttacaaccaa caatgacata cggaatagtg 68733 

gcagggatga tcggaaaaaa aatacagcaa ataggcaata gtaaatttta atttgtgcgc 68793 

cgtttcacga taatattttc ttgtagaaaa ggattctcct tgcaatagag gagtacaaaa 68853 

agatttgtac aaactactac tagcatttaa taataaaaaa taattttgat aacaaaatat 68913 

aaattaaaat atgaaaattc atgcatttaa aattttaagt gttgaaattt ccttaaaaaa 68973 

tagtttgatt gattgatata atatttcttt attttgatat atatatatat atatatatat 69033 

atatataacg ataattgata gtaaaatttt acttttactt ttaaatacat atataaaaat 69093 

taaggaaata gtatgtgatt aattttacta cttttctata atatagttga aaataagaaa 69153 

taaaatcaag acacgtactt gtttgctatt attatttaaa ttagaaaatg aaataaatat 69213 

tttattgaac ctaaagtgtc tttaatcatt tagcatggag atcatttgga aaagttgtta 69273 
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taatttaact agatgtccgt ttgaaaaagt ttatgtgtat cttaaagtta tattattgta 69333 

taaagtgtat gtttcgaaga acttattaaa ataatttata atatatatct atgataaaga 69393 

ctaacaataa ctaaaaaaaa ctttgcatca cattgttaat cttttacatt aatttaaaat 69453 

ataattcaca tattttattt tttatttatt atgaatttta attataatac atattcaaaa 69513 

ttatttattt attataaatt ttagttaaat aaaataaaca tttatttttc aaattctaca 69573 

agctaataaa ctagtattac taaatgcata gagtatgagc aaattcgtgt aggtggattt 69633 

tgtatattat atatgttttt cgttaaataa ttattactcg actttagaag gttgaaaaag 69693 

gtcataaaaa tttagctatt atgaatctaa actttctttc tatttttctt ccgatccttt 69753 

tgttttttca tcacattagt gatcgtatat gttatttttt ctcgattttt ttcctatctc 69813 

tttattcctt ctacctaata cactaaaaat gatatatata catttttcca caaagactcc 69873 

tgcccttttc tatcagctaa aattatttat gtacaaataa aaaaggtaca aacacaacat 69933 

ttatttatga acagataaac gtttttgtga gacattaact gaacctactc tatcaagctt 69993 

attattacta ctactactta tcttcactcc accacactgt gtcactaaaa ccggaaccat 70053 

ccccatacaa aattctactg aagacaacat atcccccaat attcccaatg catcagcgtt 70113 

ctccatgaaa gttgtcattt cttttccatt caaagatcca tcattgtggc gccttcccac 70173 

catcacaaga tcatagtttc cttccaaact atgcactgct tccaacacct ccaccccatc 70233 

gtccaccgta atctcgtacc aacaaacgtt accaatgcca tatttcatgc tcttgaactc 70293 

gtcaattaac ccctcgtcca acatggtatc ttcctcttcc tcttcacgct cttctcttgt 70353 

caaaataatt ttacaaccac acggtttctt gttcacgata acaaacctaa acaagctcac 70413 

cctcgtatct gcacgctccg acattcgaat tcccaatgcc agagcttccc tatcgtgggc 70473 

cccacctatg aagaatacac ccacgttgaa atacatgttg ttgttgttgg acgcgcccag 70533 

ccgagagtgc cggtccacga gtatccccaa cgtgcatggc gcatgcgctt gaaacctagt 70593 

attcatcttc ctgatggagg ctgccacgtg tccgacaagg tcaatgttgc cgttttcgtg 70653 

aaaagggatg ataatgaaag gcaccatatt gtcttgggcg aggttgaaaa tggcgtcgtg 70713 

catgctcttg taaggtgcca cgttgatgta gggaagaacc ttgactggcc cacttgagtt 70773 

gttggagtag ttttcgaagg cttgcatgat gtggttggtg ttggggtaat tcacagacaa 70833 

gaattttctg cgaccgtgtc tatgttttat gggaaggaga atgggtgcac ttttccccac 70893 

gagctcgata aggtggactg cgtagacgca tatggggctc tcttgcactg ggttgcactc 70953 

ttctaataag gcagtgatgc cacgcacgtt tgcttcatta tgtacacaac aaacaatgtg 71013 
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aaactctctg tttcttggag tgctttggat cgttctcagt tccccttcga ataagctttc 71073 

tgcgtgtact cgagggcgat gcttgtacaa tatgttaacc aagggtgtta caatcgcggt 71133 

tataagtacc acacagaaca ccaattgact gaatgtatcc tcatccaaca tctacaaaac 71193 

gtaataataa cataaattag atttgatttg atgtattgta catacaatgc attttcaatg 71253 

tttttttaaa tgaattagtg cttgagttta tatttcagtg ttattttaat ccccaaagtt 71313 

atggaagagt caaataaatt ttgatttttt tatgatttaa ttcgagtcct tcaaattggc 71373 

gtcttttttc tcgttttggt tcccagtact taagtcataa taataataat aataataata 71433 

ataataataa taataaaagt ttaggaagaa aaatgagaaa actaattagt ttcgggtttt 71493 

atttaaggtt tttttagttt cagaaactaa aatgacatat agatacaaat ttaaagatta 71553 

aattcgtcat ttactcattt tttattagac aaaatttagg tttaagcatg catacctttt 71613 

gtttcttgaa tctagccaaa cttatgagtt gagttatacc cttgatgttc aacatgaggc 71673 

caagtagcgt gccatgttta ggtctaatgt tgtacgctag agaaaccaac acacaggcca 71733 

gcaacttggc caagtctccc gcgaagaaaa caccctgaag ggtcaaaaat aagcgccaat 71793 

cctcaagtgc agacaaatcc gtgttgatgc caatgtacac aaagaagaag ggtagcaaga 71853 

actcagtggt aagaacttca ctcttctccg ctagcgttgt ccctaagggg gggccacttg 71913 

gtacgaccaa accaaaaatc aagggtccca tgagaaacgt tacgcccatc aagtctccca 71973 

caccagccat cactagcacc ccaagaagta tcaacacaac ataaagttcc ttcacaggtt 72033 

ttccaactgg ggttctcata gcaatcaatt tcatggttgg tcgaagaaca aagaagttga 72093 

agaagacgag caagcaccag tttcccaaca atgcgattga ttttttcaca tcaacattag 72153 

atgtgaagct atgcataacg atgaaaagcc atagtatgat gtcgttgatc atggaggaag 72213 

agagagcgat ctggccaagt tctgtggctg tgaggttcag ttcaagcatt gcatcagaca 72273 

cgacggggaa gttgctcaac gacattaagc agcttactga gacacgcgcg attgttaaag 72333 

atgcagagga aatttgttgg gggtgataat agagacataa gagtgctaag ataaccacaa 72393 

atgaagcgag gaagggaatt actccaagtc gccacgtgct tttggctgct cttatggtca 72453 

ttagtacgtc cattttcaat gcaactaaga atacgaaata tacggcgccg gttaaggatg 72513 

ccatcactaa atactccgtc tgtctcggtg gaaataaaac ctgccaatac gtcttgttcc 72573 

gccccaaaaa tgtggggccc aaaataatgc caccctgcac attttcacta gacacaactt 72 633 

taacattcaa actaaaccta taattataat ttacaaatca tacaagaatg aatttcttct 72693 

gctagaatta acagtttcaa accttaaatt atactaatag atggttaaga tttctactta 72753 
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aaatcatata tatcatttgc tttcgttttc aatgctaaaa tgatgtcagt aataagcaac 72813 

agtaataatc acattcatga taatagtgat aaaatgatca ccacagtaat atcaatcaca 72873 

atagtcaaaa taataataat aatgaccttg atgtgaaaac tgctaaagtg aattttatat 72933 

aaggaaatca ttctcatata gaaatgataa aattacttat tatgagaatg aaaacaataa 72993 

attcttattt gaatggttag atttaaaaaa tacatcactt cttattaagt ggtcatgtgt 73 053 

gaacattaaa ttacctttaa tctttatcat aattactctt ttcattctta aattaagatt 73113 

ttttttctaa tttctagata tattaattat ttttttctta aatattctta cttaattatt 73173 

ttctcatcaa atattaatga gatgaataga gaaataagaa aagaataatt tttgaatgat 73233 

aatataatta attaattaat aaatttaatg tgattaatta aattaattat ttttcttaag 73293 

acacataaat tagttgaaag gtaattgtaa taagggacag acggagtgaa ataaattgct 73353 

actaatttca taaataaaaa accttcaaac tatgagcaat cgtggtcatt gaaaacaaca 73413 

aagaaatgaa atcctaactt tttatcatga gataaatttt cttggcaaaa tgatagacag 73473 

accaaaatgg tgtcttccat gtctctcctc cttccacaaa caaacgttac cccatacctc 73533 

agacactcct cgacaactat tggacttgtg gagtccaaca ttggactctt tgaagaataa 73593 

ttcggaacag cctcaactac cttcaatttg gaactataaa ccacgttatc acactgcctt 73 653 

gtcggagaag tgcacctatc cagcttatgg cgctttcggc tgttttccag cggcgagact 73713 

aacacattaa tagttatgta tttcaagggc aaaaggtcca aactctagcg tgtggaaggg 73773 

cgtgaggtgg attccatggg agaagaagtc tcgcatcatg ttgaggaata aatggtgtat 7 3833 

aaggagaagg ctaagcgtca tgtttgagga gactaatttc ttgaagacaa tttcaagaaa 73893 

atcatctttg aatatttgat ttttaagaca gttttaagaa aatcatcttt aaacactcaa 73953 

tttttaagat gatttttata aaattgtcat catacatctt ctattattta caaaattatt 74013 

actgcctaac attttaagac aaatttttaa aattatctta aaaaatacgt tgtaaaaatt 74073 

attttttagt agtataatca tatttgtcac tgttttcatt atcattatta tcaatatttt 74133 

tattgttatc actaccatct tcacctatca ttaacatcat cattccatcg ttgtcaacgt 74193 

gagaaggtgg tagcttgata ccgacatgat gacaatgatg acgatttgtg ataattgttg 74253 

caaaaaaagt gaaaagaaaa taaggggtta aagagagaga gaaaattgta gcctttcaat 74313 

atttttttaa atttattaaa agggaaaaaa acttttatag ctagcttacc aggacgttgc 74373 

agatgaattt gggtgttctt agaggcctga gaagaaagta aagcgttcga gagactaaga 74433 

tgacggtgaa gagttggcac aacgtfcacag gaagcacaaa ttcaaatgga cgatccccaa 74493 
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tgaaaattcc taaagagccc acatttctat cgtcttcgac acaaacttgc cactgcccat 74553 

gggaatccca ataacttgag acaattccat tgcctcgtga agtcgccata ttatctctat 74613 

atatatccaa ctaatcaaat tagtatttct gtgtttgtgt ttgcatgcat gatgacgcaa 74673 

cacaaacacg tacccaggga cataaaatgc tgagtggagc acgcaatgat atgaaaaaac 74733 

aatgtcttaa ttttttgttg ttgagttatg aagaacaatg tctgaatgaa acaaatgaaa 74793 

aaagaggagg ggaggggtct tagaagcttt agattaggaa tcggaatatt cgtcaacgcc 74853 

aagatataat agccttgcat tgcatgttga tattttcgtc aaagcatgag agttaatggt 74913 

gttacggggg cactaatcta atattatagc cactcgtgta actttaaaaa aatttctatt 74973 

ttggcaaaat ataatatcta tatatggatt gagaaattct acttaaaact agcgtaccac 75033 

taaaattaat ctcatacgaa gtcaacactt gaaagtttaa agtaactaat taaaggttgc 75093 

cctaacaatc ataggccata agcccttagt gggccttgtg gtgtgggcca ttttttaaaa 75153 

agtattgtag ggacaaaaat aaccttaatg aaagatggga gtgaatgact tttacaccct 75213 

tgctcatctc catgtttttc catagcttcc aatggtgttg gacttttgtg gtaattctat 75273 

ttggtgttgg tcagggaata tggaggcaag tgttcttctt ggaaacagtt atggggtggt 75333 

agctcgcgtg gctgtggcgg gtcttcgtgg tggagtgatg ggttcggcca tatctcgtgg 75393 

ggctaaagaa caaaaacaaa aataaagctc tgatgggttg gaaaggtgtc aaatacactc 75453 

agcatattta ataaattatt attaaaaaaa ataccaacaa agttgcaagt aacatatcct 75513 

aactcttaat taataaaatg cacaatctaa cagtttatat ttcctttatc actatgaact 75573 

atctaacaag attcaccatc ttaaacttca tcatttgcac gaacaaaagt atttaattta 75633 

ttttatctca taaatttcta tgaacctatt tttggagatg ttatatggtg aacaaaaaat 75693 

attactaatt cactaagttt atgtgcaatt aatttttagt ggagtcaatt tatctttaaa 75753 

atgagtcaaa ggacctattc tatggcaatc aacccttttc attagcttct attgttctaa 75813 

tatacaaatt ttaaatccta gtttttggtc tcaaatgttt ttaattaaat ttgttaaatt 75873 

attttttatt taaccttact aatctaaaca tcatggttat atataaatgc aagcatattt 75933 

attaagataa tattttaaaa aaattatata aatgaaaatg atataagatg tatatgaata 75993 

tataattatt taaatatata tatcagcatt attaattttt tatattttta atattaatat 76053 

aaatatattt ttataaaaat aaaaataaat tacaaatcgt aataatattt ttgtaagaaa 76113 

aataataaag ataataaaat ttacaaatat tatattttta aaaaattgca taaattattt 76173 

aatacatgtt ataatatttt aaatattcat cacgtttaca aatattttca tcatgtatac 76233 
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